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We classify Lie-Poisson brackets that are formed from Lie algebra extensions. The 
problem is relevant because many physical systems owe their Hamiltonian structure 
to such brackets. A classification involves reducing all brackets to a set of normal 
forms, independent under coordinate transformations, and is achieved with the tech- 
niques of Lie algebra cohomology. For extensions of order less than five, we find that 
the number of normal forms is small and they involve no free parameters. A special 
extension, known as the Leibniz extension, is shown to be the unique "maximal" 
extension. 

We derive a general method of finding Casimir invariants of Lie-Poisson 
bracket extensions. The Casimir invariants of all brackets of order less than five 
are explicitly computed, using the concept of coextension. We obtain the Casimir in- 
variants of Leibniz extensions of arbitrary order. We also offer some physical insight 
into the nature of the Casimir invariants of compressible reduced magnetohydrody- 
namics. 

We make use of the methods developed to study the stability of extensions 
for given classes of Hamiltonians. This helps to elucidate the distinction between 



semidirect extensions and those involving cocycles. For compressible reduced mag- 
netohydro dynamics, we find the cocycle has a destabilizing effect on the steady-state 
solutions. 
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Chapter 1 
Introduction 



The topic of this thesis is the classification and analysis of the properties of Lie- 
Poisson brackets obtained from extensions of Lie algebras. A large class of finite- 
and infinite-dimensional dynamical equations admit a Hamiltonian formulation using 
noncanonical brackets of the Lie-Poisson type. Finite-dimensional examples include 
the Euler equations for the rigid body [9], the moment reduction of the Kida vor- 
tex [65], and a low-order model of atmospheric dynamics [14]. Infinite-dimensional 
examples include the Euler equation for the ideal fluid [52,61,67,72,78], the quasi- 
geostrophic equations [39,94], and the Vlasov equation [60,66]. 

In mathematical terms, Lie-Poisson brackets naturally define a Poisson struc- 
ture (i.e., a symplectic structure [95]) on the dual of a Lie algebra. For the rigid 
body, the Lie algebra is the one associated with the rotation group, SO(3), while 
for the Kida vortex moment reduction the underlying group is SO(2,l). For the 
two-dimensional ideal fluid, the relevant Lie algebra corresponds to the group of 
volume-preserving diffeomorphisms of the fluid domain. 

Lie-Poisson structures often occur as a result of reduction [59]. Reduction 
is, in essence, a method of taking advantage of the symmetries of a system to lower 
its order. However in so doing one perhaps loses the canonical nature of the system: 
there are no longer any well-defined conjugate positions and momenta. This does not 
preclude the system from being Hamiltonian, that is these conjugate variables can 
exist locally, up to some possible degeneracy in the system (the symplectic leaves). 
The resulting Hamiltonian system (after reduction) is often of Lie-Poisson type. For 



example, the reduction of the rigid body in Euler angle coordinates (three angles and 
three canonical momenta, for a total of six coordinates) gives Euler's equations (in 
terms of only the angular momenta, three coordinates), which have a Lie-Poisson 
structure. 

Why seek a bracket formulation of a system at all? If we care about whether 
a system is Hamiltonian or not, then for noncanonical systems it is a simple way of 
showing that the equations have such a structure. We are then free to use the power- 
ful machinery of Hamiltonian mechanics. For example, we know that the eigenvalue 
spectrum of the linearized system has to have four-fold symmetry in the complex 
plane [6]. If we are concerned with the properties of the truncation of a hydrody- 
namic system, then knowing the bracket formulation can serve as a guide for finding a 
finite-dimensional representation of the system which retains the Hamiltonian struc- 
ture [64,97]. Also, there exists moment reductions — finite-dimensional subalgebras 
of infinite-dimensional algebras — that provide exact closures [63-65,83]. 

We will classify low-order bracket extensions and find their Casimir invariants. 
An extension is simply a new Lie bracket, derived from a base algebra (for example, 
SO (3)), and defined on n-tuples of that algebra. We are ruling out extensions where 
the individual brackets that appear are not of the same form as that of the base 
algebra. We are thus omitting some brackets [70,72,77], but the brackets we are 
considering are amenable to a general classification. 

The method of extension yields interesting and physically relevant algebras. 
Using this method we can describe finite-dimensional systems of several variables 
and infinite-dimensional systems of several fields. For the finite-dimensional case, an 
example is the two vector model of the heavy top [40] , where the two vectors are the 
angular momentum an the position of the center of mass. For infinite-dimensional 
systems there are examples of models of two [12,64,73], three [32,51,73], and four [33, 
70] fields. Knowing the bracket allows one to find the Casimir invariants of the 
system [36,50,91]. These are quantities which commute with every functional on 



the Poisson manifold, and thus are conserved by the dynamics for any Hamiltonian. 
They are useful for analyzing the constraints in the system [90] and for establishing 
stability criteria [31,38,68,69,71]. 

1.1 Overview 

The outline of the thesis is as follows. In Chapter 2, we review the general theory 
behind Lie-Poisson brackets. We give some examples of physical systems of Lie- 
Poisson type, both finite- and infinite-dimensional. We introduce the concept of Lie 
algebra extensions and derive some of their basic properties. Chapter 3 is devoted 
to the more abstract treatment of extensions through the theory of Lie algebra co- 
homology [19,21,47]. We define some terminology and special extensions such as 
the semidirect sum and the Leibniz extension. In Chapter 4, we use the cohomol- 
ogy techniques to treat the specific type of extension with which we are concerned, 
brackets over w-tuples. We give an explicit classification of low-order extensions. By 
classifying, we mean reducing — through coordinate changes — all possible brackets to 
independent normal forms. We find that the normal forms are relatively few and 
that they involve no free parameters — at least for low-order extensions. 

In Chapter 5, we turn to the problem of finding the Casimir invariants of the 
brackets, those functionals that commute with every other functional in the algebra. 
We derive some general techniques for doing this that apply to extensions of any 
order. We treat explicitly some examples, including the Casimir invariants of a 
particular model of magnetohydrodynamics (MHD), which are also given a physical 
interpretation. A formula for the invariants of Leibniz extensions of any order is also 
derived. Then in Section 5.6 we use the classification of Section 4.6 to derive the 
Casimir invariants for low-order extensions. 

We address general stability of Lie-Poisson systems in Chapter 6. We begin 
by reviewing the concept of stability in Section 6.1, discussing the distinctions be- 
tween spectral, linearized, formal, and nonlinear stability. We consider the difficulties 



that arise for infinite-dimensional systems. In Section 6.2 we present a review of the 
energy-Casimir method for finding equilibria and establishing sufficient conditions 
for stability. We use the method on compressible reduced MHD. In Section 6.3, we 
turn to a more general method for stability analysis, that of dynamical accessibility. 
The method uses variations that have been restricted to symplectic leaves. We then 
treat several different classes of Hamiltonian and Lie-Poisson brackets and discuss 
the role of cocycles in equilibria and their stability. Finally, in Chapter 7 we offer 
some concluding remarks and discuss future directions for research. 



Chapter 2 
Lie— Poisson Brackets 



Lie-Poisson brackets define a natural Poisson structure on duals of Lie algebras. 
Physically, they often arise in the reduction of a system. For our purposes, a reduction 
is a mapping of the dynamical variables of a system to a smaller set of variables, 
such that the transformed Hamiltonian and bracket depend only on the smaller 
set of variables. (For a more detailed mathematical treatment, see for example [1, 
10,28,57-59].) The simplest example of a reduction is the case in which a cyclic 
variable is eliminated, but more generally a reduction exists as a consequence of 
an underlying symmetry of the system. For instance, the Lie-Poisson bracket for 
the rigid body is obtained from a reduction of the canonical Euler angle description 
using the rotational symmetry of the system [40]. The Euler equation for the two- 
dimensional ideal fluid is obtained from a reduction of the Lagrangian description of 
the fluid, which has a relabeling symmetry [16,69,76,80]. 

Here we shall take a more abstract viewpoint: we do not assume that the 
Lie-Poisson bracket is obtained from a reduction, though it is always possible to do 
so by the method of Clebsch variables [69] . Rather we proceed directly from a given 
Lie algebra to build a Lie-Poisson bracket. The choice of algebra can be guided by 
the symmetries of the system. 

In Section 2.1, we give some definitions and review the basic theory behind 
Lie-Poisson brackets. We then give examples in Section 2.2: the free rigid body, 
reduced magnetohydrodynamics (RMHD), and compressible reduced magnetohydro- 
dynamics (CRMHD). These last two cases are examples of Lie algebra extensions. We 



describe general Lie algebra extensions in Section 2.3. This introduces the problem, 
and establishes the framework for the remainder of the thesis. 

2.1 Lie Poisson Brackets on Duals of Lie Algebras 

Recall that a Lie algebra g is a vector space on which is defined a bilinear opera- 
tion [, ] :gxg-»g, called the Lie bracket. The Lie bracket is antisymmetric, 

[a, (3} = -[/3,a], 

and satisfies the Jacobi identity, 

[a, [/?, 7 ]] + [/3, [ 7 , a ]] + [7, [a,0]]=O, 

for arbitrary elements a, (3, 7 in g. Lie algebras are differentiable manifolds. 

A real-valued functional defined on a differentiable manifold Ai is simply a 
map from Ai to R. (From now on, when we say functional it will be understood 
that we mean a real- valued functional.) The vector space of all functionals on A\ is 
denoted by F{M). 

The dual g* of g is the set of all linear functionals on g. The elements of g* 
are denoted by 

<£,.>: fl -R, <£,->eg*, 

where £ identifies the elements of g*. It is customary, however, to simply say £ € g* 
and express the pairing by ( , ) : g* x g — ► R. This simplifies the procedure of iden- 
tifying g and g*, especially for infinite-dimensional Lie algebras, where the pairing 
is typically an integral. Note that functionals can be defined on g*, since it is a dif- 
ferentiable manifold. In finite dimensions, g and g* are isomorphic as vector spaces 
(they have the same dimension). However, g* does not naturally inherit a Lie algebra 
structure from g. In infinite dimensions, the two spaces need not be isomorphic. 



Let M be a differentiable manifold. A Poisson structure on J-(M) is a Lie 
algebra on J-(M) with bracket { , } that satisfies the derivation property 

{F , GH} = {F,G}H + G{F,H} 1 

where F, G, H £ J-(M). (This property is also called the Leibniz rule.) The mani- 
fold M with the bracket { , } is called a Poisson manifold. 

For the remainder of the thesis, we will be interested in the case where Ai is 
the dual q* of a Lie algebra q. The Lie-Poisson bracket provides a natural Poisson 
structure on T(q*), given the Lie bracket [ , ] in q. It is defined as 



{F,G} ± (Z) = ±U 



6F_ 6G 



(2.1) 



where F and G are real- valued functionals on q* , that is, F, G : q* — > R, and £ 6 0*. 
The functional derivative 5F/5^ £ g is defined by 



SF[Z;6Z]:=±F[Z + e6Z] 



e =0 \ ^^ 



We shall refer to the bracket [ , ] as the inner bracket and to the bracket { , } as 
the Lie-Poisson bracket. The dual q* together with the Lie-Poisson bracket is a 
Poisson manifold. The sign choice in (2.1) comes from whether we are considering 
right invariant (+) or left invariant (— ) functions on the cotangent bundle of the Lie 
group [58,61], but for our purposes we simply choose the sign as needed. 

For finite-dimensional algebras, the Lie-Poisson bracket (2.1) was first written 
down by Lie [54] and was rediscovered by Berezin [13]; it is also closely related to 
work of Arnold [5], Kirillov [46], Kostant [48], and Souriau [86]. 

Before we can describe the dynamics generated by Lie-Poisson brackets, we 
need a few more definitions. The adjoint action of q on itself is the same as the 
bracket in q, 

ad a j3 = [ a , (3 ] , 



where a, (3 € g. From this we define the coadjoint action ad^ of g on g* by 1 

(adU,/?):=<£,ad Q /?), (2.3) 

where ^£g*. We also define the coadjoint bracket [ , ] . g x g* —> g* to be 

[a,e] f :=adU, (2-4) 

so that 

([<*,£]* ,/3):=U> [«,/3]>; (2-5) 

the bracket [ , ]* satisfies the identity 

Since the inner bracket is Lie, it satisfies the Jacobi identity, and consequently 
the form given by (2.1) for the Lie-Poisson bracket will automatically satisfy the 
Jacobi identity [2, p. 614]. This is proved in Appendix A. 

We are of course interested in generating dynamics from the Lie-Poisson 
bracket. This is done in the usual manner, by inserting a Hamiltonian functional in 
the bracket. For any Poisson structure, given a Hamiltonian functional H : A4 — ► M, 
the equation of motion for £ £ M is 

where a dot denotes a time derivative. For a Lie-Poisson bracket, we have M = g*, 
and we use the definition (2.1) of { , } to write 



e = u 






1 We are using the convention of Arnold [9, p. 321], but some authors define ad^ with a minus 
sign, so that the canonical bracket and its coadjoint bracket have the same sign in (2.12) when g 
and g* are identified. 



where A is a Kronecker or Dirac delta, or a combination of both for an infinite- 
dimensional system of several fields (that is, £ can be a vector of field variables). We 
then use the definition of the coadjoint bracket (2.5), 



£ 



SH ri 
1$ 



,A>, 



and finally use the property of the delta function to identify £ with the left slot of 
the pairing, 



- e 



(2.6) 



Thus, for Lie-Poisson brackets the dynamical evolution of £ is generated by the 
coadjoint bracket. 

We close this section by commenting on the nature of the dynamics generated 
by Lie-Poisson brackets. The elements of a Lie algebra g are usually regarded as 
infinitesimal generators of the elements of a Lie group G near the identity. (We also 
say that the Lie algebra is the tangent space of the Lie group at the identity.) The 
coadjoint orbit through £ £ g* is defined as 



Orb(£) := JAd,U| aeG\. 



(We will not rigourously define it here, but simply think of Ad^ : g* — ► g* as a finite 
version of the infinitesimal coadjoint action adl : g* — ► g*. See for example Arnold [9, 
pp. 319-321].) The coadjoint orbits tell us what parts of q* can be reached from a 
given element £* by acting with the group elements. For example, the coadjoint 
orbits for the rotation group 50(3) are spheres [58, p. 400], so two elements of g* 
belong to the same coadjoint orbit if they lie on the same sphere (the elements can 
be mapped onto each other by a rotation). 

The infinitesimal generator at £ of the coadjoint action is 

»fo.(0:=ad{£ (2.7) 
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Comparing this to the equation of motion (2.6), and recalling the definition of the 
coadjoint bracket (2.4), we see that £ lies along the direction of the infinitesimal 
generator adl„,r, at £. 

What does this all mean? The time-evolved trajectory {£(£) | t > 0} of £ 
must go through points in q* that can be reached by Ad„£(0), where £(0) is an 
initial condition, for some a £ G. To put it more succinctly, 

{£(*) I t > 0} C Orb(£(0)). (2.8) 

For G = SO(3), since the coadjoint orbits are spheres then the only trajectories 
allowed must lie on spheres. This makes SO(3) the natural group to describe the 
motion of the rigid body, as we will see in Section 2.2.1. Note that equality in (2.8) 
does not usually hold, since the trajectory is one-dimensional, whereas the coadjoint 
orbits are usually of higher dimension. 

2.2 Examples of Lie— Poisson Systems 

We will say that a physical systems can be described by a given Lie-Poisson bracket 
and Hamiltonian if its equations of motion can be written as (2.6) for some H; 
the system is then said to be Hamiltonian of the Lie-Poisson type. We give four 
examples: the first is finite-dimensional (the free rigid body, Section 2.2.1) and the 
second infinite-dimensional (Euler's equation for the ideal fluid, Section 2.2.2). The 
third and fourth examples are also infinite-dimensional and serve to introduce the 
concept of extension. They are low-beta reduced magnetohydrodynamics (MHD) 
in Section 2.2.3 and compressible reduced MHD in Section 2.2.4. These last two 
examples are meant to illustrate the physical relevance of Lie algebra extensions. 

2.2.1 The Free Rigid Body 

The classic example of a Lie-Poisson bracket is obtained by taking for q the Lie 
algebra of the rotation group SO(3). If the eu\ denote a basis of q = so(3), the Lie 
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bracket is given by 

[e W ,e (i) ] =c£-e (fc) , 

where the cjl- = £ijk are the structure constants of the algebra, in this case the totally 
antisymmetric symbol. Using as a pairing the usual contraction between upper and 
lower indices, with (2.1) we are led to the Lie-Poisson bracket 

U,g\- cx 3 *k di . dt .> 

where the three- vector £ is in g* , and we have chosen the minus sign in (2.1). The 
coadjoint bracket is obtained using (2.3), 

1(3,4 = -c^H k . (2.9) 

If we use this coadjoint bracket and insert the Hamiltonian 

H = ±(r 1 fe i e j (2.10) 

in (2.6) we obtain 

Notice how the moment of inertia tensor / plays the role of a metric — it allows 
us to build a quadratic form (the Hamiltonian) from two elements of g* . If we 
take I = diag(ii, I2, I3), we recover Euler's equations for the motion of the free rigid 
body 

'> = (£-£)**• 

and cyclic permutations of 1,2,3. The It are the angular momenta about the axes 
and the Ii are the principal moments of inertia. This result is naturally appealing 
because we expect the rigid body equations to be invariant under the rotation group, 
hence the choice of 50(3) for G. 



12 



2.2.2 The Two-dimensional Ideal Fluid 

Consider now an ideal fluid with the flow taking place over a two-dimensional do- 
main fi. Let 5 be the infinite-dimensional Lie algebra associated with the Lie group of 
volume-preserving diffeomorphisms of £1. In two spatial dimensions this is the same 
as the group of canonical transformations on 0,. The bracket in g is the canonical 
bracket 

da db db da 
dx dy dx dy' 

We formally identify g and g* by using as the pairing ( , ) the usual integral over the 
fluid domain, 



{F,G)= /F(x)G(x)d 2 x, 
Jn 



where x := (x,y). For infinite-dimensional spaces, there are functional analytic issues 
about whether we can make this identification, and take g** = g. We will assume here 
that these relationships hold formally. Sec Marsden and Weinstein [57] for references 
on this subject and Audin [10] for a treatment of the identification of g and g* . 

For simplicity, we assume that the boundary conditions are such that surface 
terms vanish, and we get 



it 



(2.12) 



from (2.5). (Without this assumption the coadjoint bracket would involve extra 
boundary terms.) We take the vorticity u as the field variable £ and write for the 
Hamiltonian 



H[w] = -i<w,V 



UJ 



where 



(V- 2 kj)(x) := / K(x|x')o;(x / )dV, 
Jn 
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and K is Green's function for the Laplacian. The Green's function plays the role of 
a metric since it maps an element of g* (the vorticity w) into an element of g to be 
used in the right slot of the pairing. This relationship is only weak: the mapping K 
is not surjective, and thus the metric cannot formally inverted (it is called weakly 
nondegenerate). When we have identified g and 0* we shall often drop the comma 
in the pairing and write 

H[u;] = -±(u;<P) = l(\VcP\ 2 ), 

where to = V 2 defines the streamfunction </>. We work out the evolution equation 
for uj explicitly: 

Sco (x) 5H 



Mx') ' M x ') 



dV 



,/M A2 X , 



lu(x) = {lu,H}= f w(x') 
Jn 

= /u,(x')[<5(x-x'),-<Kx')] d 2 
Jn 

= f 5(x-x!)[w(x!),<f>(x!)] dV 
= Kx),^(x)]. 

This is Euler's equation for a two-dimensional ideal fluid. We could also have written 
this result down directly from (2.6) using [,]=—[,]• 

2.2.3 Low-beta Reduced MHD 

This example will illustrate the concept of a Lie algebra extension, the central topic 
of this thesis. Essentially, the idea is to use an algebra of n-tuples, which we call an 
extension, to describe a physical system with more than one dynamical variable. As 
in Section 2.2.2 we consider a flow taking place over a two-dimensional domain f2. 
The Lie algebra q is again taken to be that of volume preserving diffeomorphisms 
on S7, but now we consider also the vector space V of real-valued functions on f2 (an 
Abelian Lie algebra under addition). The semidirect sum of g and V is a new Lie 
algebra whose elements are two-tuples (a, v) with a bracket defined by 



](a,v),((3,w)] := ([a, (3] , [a ,w] - [(3, v\) 



(2.13) 
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where a and /3 £ Q, v and w £ V. This is a Lie algebra, so we can use the prescription 
of Section 2.1 to build a Lie-Poisson bracket, 

Let cj = V 2 be the (scalar) parallel vorticity, where (j) is the electric potential, tp 
is the poloidal magnetic flux, and J = X7 2 ip is the poloidal current. (We use the 
same symbol for the electric field as for the streamfunction in Section 2.2.2 since 
they play a similar role.) The pairing used is a dot product of the vectors followed 
by an integral over the fluid domain (again identifying q and q* as in Section 2.2.2). 
The Hamiltonian 



H[u-^\ = l -f (|V0| 2 + |VVf) d 2 x 



with the above bracket leads to the equations of motion 

u = [u,<t>] + [?p,J} , 

(2.14) 

i> = [i>A] ■ 

This is a model for low-beta reduced MHD [73,87,98], obtained by an expansion in 
the inverse aspect ratio e of a tokamak, with e small. With a strong toroidal magnetic 
field, the dynamics are then approximately two-dimensional. The model is referred 
to as low-beta because the electron beta (the ratio of electron pressure to magnetic 
pressure, see (2.16)) is of order e 2 . 

For high-beta reduced MHD, the electron beta is taken to be of order e. 
There is then an additional advected pressure variable, which couples to the vorticity 
equation, and the system still has a semidirect sum structure [35,88]. 

Benjamin [12] used a system with a similar Lie-Poisson structure, but for 
waves in a density-stratified fluid. Semidirect sum structures are ubiquitous in ad- 
vective systems: one variable (in this example, 4>) "drags" the others along [90]. 
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2.2.4 Compressible Reduced MHD 

In general there are other, more general ways to extend Lie algebras besides the 
semidirect sum. The model derived by Hazeltine et al. [33-35] for two-dimensional 
compressible reduced MHD (CRMHD) is an example. This model has four fields, 
and as for the low-beta reduced MHD system in Section 2.2.3 it is obtained from an 
expansion in the inverse aspect ratio of a tokamak. It includes compressibility and 
finite ion Larmor radius effects. The Hamiltonian is 



H[»,v,j>M = \J q (|V^| 2 + . 2 + (P ^ eX)2 +|V^| 2 ) d 2 x, 



(2.15) 



where v is the ion parallel (toroidal) velocity, p is the electron pressure, 2 /3 e is the 
electron beta, 



Pe:= 



2T C 

VA 2 



(2.16) 



a parameter that measures compressibility, va is the Alfven speed, and T c is the 
electron temperature. The other variables are as in Section 2.2.3. The coordinate x 
points outward from the center of the tokamak in the horizontal plane and y is the 
vertical coordinate. The motion is made two-dimensional by the strong toroidal 
magnetic field. The bracket we will use is 



{f - g} =JM 



p 



5F 5G 

(_l y |_ Slu ' Su 

5F 5G 



8lo ' 5p 



+ 



+ v 

5£ 6G 

5p ' 5lo 



5£ 6G_ 

5oj ' 5v 



+ 



+ ip 



5£ 5G 

5v ' 5io 
<5F 6G 

5lo ' 5ip 



+ 



5F_ 5G_ 

5ip ' 5uj 



- Pe^ 



5F_ 6G 
Sp ' 5v 



+ 



5F_ 5G_ 

Sv ' 5p 



)) 



d 2 x. (2.17) 



2 The variable p is actually a deviation of the pressure from a linear gradient. The total pressure 
is p = p — 2/3 c x. 
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Together this bracket and the Hamiltonian (2.15) lead to the equations 

u = [w,<l>] + [il>,J]+2[p,x] 

v = [v,<j)] + [ij,p]+2f3 c [x,iP] 

P = [P,4>] +Pe[lJ>,v] 

which reduce to the example of Section 2.2.3 in the limit of v = p = (3 C = (when 
compressibility effects are unimportant). In the limit of (5 C = 0, the parallel velocity 
decouples from the other equations, and we recover the three equations of high-beta 
reduced MHD for uj, tp, and p [35]. 

It is far from clear that the Jacobi identity is satisfied for (2.17). A direct 
verification is straightforward (if tedious), but we shall see in Section 2.3 that there 
is an easier way. 

2.3 General Lie Algebra Extensions 

We wish to generalize the types of bracket used in Sections 2.2.3 and 2.2.4. We build 
an algebra extension by forming an n-tuple of elements of a single Lie algebra 0, 

a:= (ai,...,a n ) , (2.18) 

where a; Sg. The most general bracket on this n-tuple space obtained from a linear 

combination of the one in q has components 

n 

[<*»/?]* = E W ^ K>/^] , A = l,...,n, (2.19) 

where the W\^ v are constants. (From now on we will assume that repeated indices 
are summed unless otherwise noted.) Since the bracket in q is antisymmetric the W } s 
must be symmetric in their upper indices, 

W x ^ = W x ^ • (2.20) 
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This bracket must also satisfy the Jacobi identity 

[«, [0,7}] X + [P, [7,a]] A + [ 7 , [a,/3]] x = 0, X = l,...,n. 

The first term can be written 

[a,[P,l}}x = W x ^W a ^[a T ,[P fl , lu ]}, 

which when added to the other two gives 

W x aT Wr ([or , [/3 M , 7 ,]] + [/3r, [7^,^]] + [7r, [ a„, A,]]) = 0. 

We cannot yet make use of the Jacobi identity in g: the subscripts of a, (3, and 7 
are different in each term so they represent different elements of g. We first relabel 
the sums and then make use of the Jacobi identity in g to obtain 

(w x ° r wr - w x ™ w a T n [ or , [ /^ , 7 „ ] ] 

+ W ^</ r - W x ™ WS*) [ /?„ , [7, , ocr ] } = . 

This identity is satisfied if and only if 

W x aT W ff ^ = W x au W a T ^ , (2.21) 

which together with (2.20) implies that the quantity W\ T W a ^ v is symmetric in all 
three free upper indices. If we write the Ws as n matrices W'"' with rows labeled 
by A and columns by fi, 



11 ■- W x ^, (2.22) 

A 



then (2.21) says that those matrices pairwise commute: 

W (u) W (a) = w (a) w (y) _ ^^ 

Equations (2.20) and (2.23) form a necessary and sufficient condition: a set of n 
commuting matrices of size n x n satisfying the symmetry given by (2.20) can be 
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used to make a good Lie algebra bracket. From this Lie bracket we can build a 
Lie-Poisson bracket using the prescription of (2.1) to obtain 



We now return to the two extension examples of Sections 2.2.3 and 2.2.4 and 
examine them in light of the general extension concept introduced here. 



2.3.1 Low-beta Reduced MHD 

For this example we have (^°,^ 1 ) = (io,tp), with 





1 



The reason why we start labeling at will become clearer in Section 4.4. The 
two W^' must commute since W^- ' = I, the identity. The tensor W also satisfies 
the symmetry property (2.20). Hence, the bracket is a good Lie algebra bracket. 



2.3.2 Compressible Reduced MHD 

We have n = 4 and take (£° , £* , £ 2 , £ 3 ) = (uj,v,p,ip), so the tensor W is given by 



W<® 



W^ 



f 1 






1 










1 





5 


Vo 





V 









0\ 



1 








\o 


-A, 





0/ 



w^ 



w^ 



(° 








°\ 


1 

















Vo 





-A, o/ 


(° 





0\ 
























V 





0/ 





(2.24) 



It is easy to verify that these matrices commute and that the tensor W satisfies 
the symmetry property, so that the Lie-Poisson bracket given by (2.17) satisfies the 
Jacobi identity. (See Section 4.4 for an explanation of why the labeling is chosen to 
begin at zero.) 
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Figure 2.1: Schematic representation of the 3-tensor W for compressible reduced 
MHD. The blue cubes represent unit entries, the red cubes are equal to — /? e , and 
all other entries vanish. The vertical axis is the lower index A of W\ fJ,u , and the 
two horizontal axes are the symmetric upper indices \x and v. The origin is at the 
top-rear. 



The 3-tensor W can be represented as a cubical array of numbers, in the same 
way a matrix is a square array. In Figure 2.1 we show a schematic representation 
of W for CRMHD. The blocks represent nonzero elements. 



Chapter 3 
Extension of a Lie Algebra 



In this chapter we review the theory of Lie algebra cohomology and its application 
to extensions. This is useful for shedding light on the methods used in Chapter 4 
for classifying the extensions. However, the mathematical details presented in this 
chapter can be skipped without seriously compromising the flavor of the classification 
scheme of Chapter 4. Most necessary mathematical concepts will be defined as 
needed, but the reader wishing more extensive definitions may want to consult books 
such as Azcarraga and Izquierdo [21] or Choquet-Bruhat and DeWitt-Morette [20]. 

3.1 Cohomology of Lie Algebras 

We now introduce the abstract formalism of Lie algebra cohomology. Historically 
there were two different reasons for the development of this theory. One, known as 
the Chevalley-Eilenberg formulation [19], was developed from de Rham cohomology. 
de Rham cohomology concerns the relationship between exact and closed differential 
forms, which is determined by the global properties (topology) of a differentiable 
manifold. A Lie group is a differentiable manifold and so has an associated de Rham 
cohomology. If invariant differential forms are used in the computation, one is led 
to the cohomology of Lie algebras presented in this section [20,21,47]. The second 
motivation is the one that concerns us: we will show in Section 3.2 that the extension 
problem — the problem of enumerating extensions of a Lie algebra — can be related to 
the cohomology of Lie algebras. 

Let g be a Lie algebra, and let the vector space V over the field K (which we 
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take to be the real numbers later) be a left g-module, 1 that is, there is an operator p : 
gxV^V such that 

Pa {V + V 1 ) = p a V + p a v', 
Pa+a' V = p a V + p a > V, 
P[a,a'] v = [Pa,Pa>] V, (3.1) 

for a, a' £ g and v, v' £ V. The operator p is known as a left action. A g-module gives 
a representation of g on V . The action p defines a Lie algebra homomorphism from g 
to the algebra of linear transformations on V. A Lie algebra homomorphism / : q — > a 
is a linear mapping between two Lie algebras q and o which preserves the Lie algebra 
structure, that is 

f([<x,P] B ) = [f(<x),f(P)] a > «>/3e . 

An n-dimensional V- valued cochain u> n for g, or just n-cochain for short, is 
a skew-symmetric n-linear mapping 

cu n : gx gx ■■■ x g — > V. 

Cochains are Lie algebra cohomology analogues of differential forms on a manifold. 
Addition and scalar multiplication of n-cochains are defined in the obvious manner 

by 

(u> n + uj' n ){ui, ...,a n ):= u n (ati, . . . ,a n ) + w' n (ai, . . -,a n ), 
(auj n )(ai,... ,a n ) := ow n (ai,... ,a n ), 

where a±, . . . , a n G g and a G K. The set of all n-cochains thus forms a vector space 
over the field K and is denoted by C n (g, V). The 0-cochains are defined to be just 
elements of V, so that C°(g, V) = V. 



1 When V is a right g-module, we have p\ a>a n — — [ p a , Pa' ] ■ The results of this section can be 
adapted to a right action by changing the sign every time a commutator appears. This sign choice 
is for similar reasons as that of (2.1). 
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The coboundary operator is the map between cochains, 

s n :C n ( 9 ,V)^C n+1 ( 9 ,V), 
defined by 

n+l 

(s n W„)(«l,... ,Qn+l) := 2_/-) l+1 /9a^n(ai,--- ,«i,... , «n+l) 

i=l 
n+l 

+ ^ (-)' ?+fc ^n([aj ,afc] ,«i,... ,aj, ...,dfe,... ,a n +i), 
j<k 
where the caret means an argument is omitted. We shall often drop the n subscript 
on s n , deducing it from the dimension of the cochain on which s acts. 

We shall make use mostly of the first few cases, 

(svo)(ai)=p ai uo, (3.2) 

(suii)(ai,a 2 ) = p ai vi(a2) - Pa 2 vi(ai) - wi([ai , a 2 ]), (3.3) 
(suj 2 )(ai,a 2 ,a 3 ) = p ai w 2 (a 2 ,a 3 ) + p a2 w 2 (a 3 , ai) + p a:i w 2 («i, a 2 ) 

- w 2 ([ai ,a 2 ] ,03) -w 2 ([ai2 , a 3 ] ,ai) -a; 2 ([a 3 ,ai] ,a 2 ) . (3.4) 

It is easy to verify that sto n defines an (n + l)-cochain, and it is straightforward 
(if tedious) to show that s n +is n = s 2 = 0. For this to be true, the homomorphism 
property (3.1) of p is crucial. 

An n-cocycle is an element u n of C n (Q,V) such that s n to n = 0. An n- 
coboundary uj co \^ is an element of C n (g, V) for which there exists an element w n _i 
of C n_1 (g, V) such that uj co \, = sw n _i. Note that all coboundaries are cocycles, but 
not vice-versa. 

Let 

Z^g,V) = keis n 
be the vector subspace of all n-cocycles, Z™(q, V) C C n (g, V), and let 

B'p(g,V) = range s n _i 



■z\ 



be the vector subspace of all n-coboundaries, B™(q,V) C C h (q,V). The nth coho- 
mology group of with coefficients in V is defined to be the quotient vector space 

H;( e ,V):=Z^ 9 ,V)/B;( 9 ,V). (3.5) 

Note that for n > dim , we have #™(g, V) = Z™(g, V) = B%(q, V) = 0. This is be- 
cause one cannot build a nonvanishing antisymmetric quantity with more indices 
than the dimension of the space (at least two of the indices would always be equal, 
which implies that the quantity is zero). 

3.2 Application of Cohomology to Extensions 

In Section 2.3 we gave a definition of extension that is specific to our problem, in 
terms of the tensors W . We will now define extensions in a more abstract manner. 
We then show how the cohomology of Lie algebras of Section 3.1 is related to the 
problem of classifying extensions. In Chapter 4 we will return to the more concrete 
concept of extension, of the form given in Section 2.3. 

Let fi'-Qi—> 0i+i be a collection of Lie algebra homomorphisms, 

• • • *- Qi *- Qi+i *- Qi+2 »- • • • . 



By the homomorphism property of /j, we have 

/i([a,i9] ft ) = [/i(a) J /iC9)] fc+lJ a,(3e 5i . 
The subscript on the brackets denotes the algebra to which it belongs. 

The sequence fi is called an exact sequence of Lie algebra homomorphisms if 

range fi = ker f i+1 . 

Let Q, t), and o be Lie algebras. The algebra f) is said to be an extension of g 
by o if there is a short exact sequence of Lie algebra homomorphisms 

> a ^U f, - 1 - ^ o • (3.6) 
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The homomorphism i is an insertion (injection), and n is a projection (surjection). 
We shall distinguish brackets in the different algebras by appropriate subscripts. 
We also define r : q — > f) to be a linear mapping such that ix o r = li (the identity 
mapping in g). Note that r is not unique, since the kernel of n is not trivial. Let (3 £ 
h, ?? G o; then 

*"[/?, *?7]f, = [n/3,Tvii]] g = 0, 

using the homomorphism property of n and it o i = 0, a consequence of the ex- 
actness of the sequence. Thus [P ,irj]^ £ ker-zr = range i, and i a is an ideal in f) 
since [f3 ,irj] £ ia. Hence, we can form the quotient algebra fj/a, with equivalence 
classes denoted by (3 + o. By exactness tt{(5 + a) = ir (3, so g is isomorphic to \)/a and 
we write q = t)/a. 

Though i a is a subalgebra of f), rg is not necessarily a subalgebra of t), for 
in general 

[ra,T(i\ /r[a,/3] , 

for a, f3 £ q; that is, r is not necessarily a homomorphism. The classification problem 
essentially resides in the determination of how much r differs from a homomorphism. 
The cohomology machinery of Section 3.1 is the key to quantifying this difference, 
and we proceed to show this. 

To this end, we use the algebra o as the vector space V of Section 3.1, so 
that a will be a left g-module. We define the left action as 

p a f] ■= r 1 [ra,i7]} t) (3.7) 

for a £ q and r] £ a. For a to be a left g-module, we need p to be a homomorphism, 
i.e., p must satisfy (3.1). Therefore consider 

[Pa,Pp]V= (paPfS ~ Pf3pa) V 

= p a r l [r ,ir)}t,- Ppi~ l [r a ,ir)\ 



r 1 



t(x,[t (5 ,irj]h ~ i X t (3 ,[t a ,ir] 
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which upon using the Jacobi identity in f) becomes 



.Pa,P/3\ri = i 



[ra,T/3],j,«77 

r[a,(3] if] 



+ r 



P[a,t3]V + i 



-1 



^([Ta-T^-rlo,^],) ,177] (3.8) 
([ra,r/3]^-r[a,/3] fl J ,ir? 



By applying 7r on the expression in parentheses of the last term of (3.8), we see that it 
vanishes and so is in ker it, and by exactness it is also in i 0. Thus the f) commutator 
above involves two elements of i 0. We define w : g x g -> by 



u(a,(3):=i x \[tu ,t/3][, - r [a,{3] g j . 
The mapping i~ l is well defined on i a. Equation (3.8) becomes 

[p a ,Pi3]r) = p[ a ,f3] r) + [w{a,l3),r}] a . 



(3.9) 



(3.10) 



Therefore, p satisfies the homomorphism property if either of the following is true: 



(i) is Abelian, 



(ii) r is a homomorphism, 

Condition (i) implies [ , ] = 0, while condition (ii) means 

\toc,tP\ = r[a,/3] g , 

which implies u> = 0. If either of these conditions is satisfied, a with the action p 
is a left g-module. We treat these two cases separately in Sections 3.3 and 3.4, 
respectively. 



3.3 Extension by an Abelian Lie Algebra 

In this section we assume that the homomorphism condition (i) at the end of Sec- 
tion 3.2 is met. Therefore a is a left g-module, and we can define o-valued cochains 
on g. In particular, to defined by (3.9) is a 2-cochain, to G C 2 (g,a), that measures 
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the "failure" of r to be a homomorphism. We now show, moreover, that w is a 
2-cocycle, uo G Z^(g,a). By using (3.4), 

(su)(a,/3,j) = p a u(/3,j) + p/3Uj{-f,a) +p 7 a;(a,/3) 

-w([a,/3] fl ,7) -w([/3,7] fl ,a) -w([7,a] flJ /3), 
= r 1 [ra,iw(/3,7)]^-u;([a,/3] g ,7) + eye. perm., 

where we have written "eye. perm." to mean cyclic permutations of a, (3, and 7. 
Using the definition (3.9) of lo, we have 



(sw)(o:,/3,7) =i x Ta,[r/3,r7] h -r[/3,7] 



,— 1 



,— 1 



+ i 1 t 



r[a,(3] & ,T-f 

ra,[r/3,T7] f) 



[a,(3] a ,l 



+ eye. perm. 



+ eye. perm. = 0. 



eye. perm. 



The first parenthesis vanishes by the Jacobi identity in f), the second by the Jacobi 
identity in g, and the other terms were canceled in pairs. Hence wisa 2-cocycle. 

Two extensions f) and f)' are equivalent if there exists a Lie algebra isomor- 
phism a such that the diagram 



-»- 



i) 







(3-11) 



*-0 



V 



is commutative, that is if a o % = i' and n = tt' o g. 

There will be an injection r associated with ir and a r' associated with ir', 
such that 7T o r = li = 7r' o r'. The linear map v = (t~ 1 t' — r must be from 5 to i a, 
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so i 1 v G C 1 (q, a). Consider p and pi respectively denned using r, i and r', i' by (3.7). 
Then 

„•/-! r _/ 



(Pa-p' a )v = i V«.«l]|-i' [t'c*,^] , 



i Hraji?)],.-? V 1 [cr(i/ + r) a^crir/b, 

(3.12) 



i 1 [t a ,ir]]. — i 1 [(v + t) a ,ir] 



h 



-i 1 [va,ir)] t) = 0, 



since a is Abelian. Hence, r and r' define the same p. Now consider the 2-cocycles to 
and u>' defined from r and r' by (3.9). We have 

v'{a,(3) -oj(a,/3) = i'^ 1 [[r'a,T'(3] v -r'[a,/3] fl J 
-r 1 ([ra,r/3][,-r[a,/3] g J 
= rV -1 ([<r(i/ + r) a , ct(^ + r) /3] h , - (r(*/ + r) [a ,/?] fl ) 
-r 1 ^[ra,r/?]^-r[a,/?] g J 

= f 1 |[(i/ + T)a,(i/ + T)|3] r !/[a^] g -[ra,T/3] f) j 
= r 1 ([ra^/3],, + [i/a,r /?](,- f [a, /3] J 
= p a (i _1 ^/3) - pfi{ir x va) - i _1 ^[Q,/3] g . 

Comparing this with (3.3), we see that 

J -u = s(r 1 v), (3.13) 

so u) and ui' differ by a coboundary. Hence, they represent the same element in H^(q, a). 
Equivalent extensions uniquely define an element of the second cohomology group H^{q, a). 
Note that this is true in particular for f) = f)', a = 1, so that the element of H^q, a) 
is independent of the choice of r. equivalent 

We are now ready to write down explicitly the bracket in f) . We can represent 
an element a £ f) as a two-tuple: a = {a\, 02) where a± £ g and 02 G a (f| = g © o as a 
vector space). The injection i is then i 02 = (0, 02), the projection it is ir (ai, 02) = «i, 
and since the extension is independent of the choice of r we take r a± = (a±, 0). By 
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linearity, 

[a,/?]„ = [(a 1 ,0),(/? 1 ,0)]„ + [(0,a 2 ),(0,/3 2 )]„ 

+ [(a 1 ,0),(0,p 2 )] i) + [(0,a 2 ),(P 1 ,0)] i) . 

We know that [ (0, a 2 ), (0, /3 2 ) ]h = since a is Abelian. By definition of the cocycle u>, 
Eq. (3.9), we have 

[(ai,0),(/3i,0)]j, = [rai,r/3i]^ 

= ia;(ai,/?i) +r[ai,/3i] fl 
= ([ai,/?i] e , w(ai,/?i)). 

Finally, by the definition of p, Eq. (3.7), 

[(ai,0),(0,/3 2 )] f) = [ra 1 ,i/3 2 ] f) = / 9 Ql /? 2 , (3.14) 

and similarly for [ (0, a 2 ), (/3i, 0) ],., with opposite sign. So the bracket is 

[«,/?](, = (["i,/3i] g , Pqi/?2 -pp 1 a 2 +u(ai,(3 1 )). (3.15) 

As a check we work out the Jacobi identity in fy: 



a,[f3,Y. 



[ai,[P,j] 1 ] s ,p ai [/3,7] 2 -p[/3, 7 ] i a 2 +a;(Qi,[/3,7] 1 )j 

"1 , [ A , 71 ]« , Pai (P/3 X 72 - P 7 i #2 + w(/?i, 7l)) 
J 

-p[/?i,7i] B a 2 + w(a;i,[/3i,7i] g )). 



Upon adding permutations, the first component will vanish by the Jacobi identity 
in g. We are left with 

<X, [P, 7]f, + C y C - P erm - = (°> (PaiP/?i -pfiiPai -P[ai,/3i]J72 

+ PaiW(/5i>7i) -w([«i ,/?i] fl >7i)J + c y c - perm., 

which vanishes by the the homomorphism property of p and the fact that w is a 
2-cocycle, Eq. (3.4). 
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Equation (3.15) is the most general form of the Lie bracket for extension 
by an Abelian Lie algebra. It turns out that the theory of extension by a non- 
Abelian algebra can be reduced to the study of extension by the center of a, which is 
Abelian [21]. We will not need this fact here, as the only extensions by non- Abelian 
algebras we will deal with are of the simpler type of Section 3.4. 

We have thus shown that equivalent extensions are enumerated by the second 
cohomology group i^?(g, a). The coordinate transformation a used in (3.11) to define 
equivalence of extensions preserves the form of and a as subsets of t). However, 
we have the freedom to choose coordinate transformations which do transform these 
subsets. All we require is that the isomorphism a between rj and f/ be a Lie algebra 
homomorphism. We can represent this by the diagram 

»- a — ^ f) -^-^ > (3.16) 



0' — *->• i)' -^ 0' 0. 

The primed and the unprimed extensions are not equivalent, but they are isomor- 
phic [96, p. 199]. Cohomology for us is not the whole story, since we are interested 
in isomorphic extensions, but it will guide our classification scheme. We discuss this 
point further in Section 4.3. 

Diagrams (3.11) and (3.16) are related to the "Short Five Lemma," which 
states that if the diagram of Lie algebra homomorphisms 




is commutative, with the top and bottom rows exact, then 

(i) 7, 5 monomorphisms ==£- a monomorphism; 
(ii) 7, S epimorphisms =^> a epimorphism; 
(iii) 7, 5 isomorphisms =^> a isomorphism. 
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A monomorphism is injective, an epimorphism is surjective, and an isomorphism is 
bijective. The important point is that the converse of the Lemma is not true: if a 
is an isomorphism then it says nothing about the properties (or even the existence) 
of 7 and 5. Note that (hi) follows immediately from (i) and (ii). The proof can be 
found in Mac Lane and Birkhoff [55] or Hungerford [42], for example. 

3.4 Semidirect and Direct Extensions 

Assume now that to defined by (3.9) is a coboundary. By (3.13) there exists an 
equivalent extension with u = 0. For that equivalent extension r is a homomorphism 
and condition (ii) at the end of Section 3.2 is satisfied. Thus the sequence 

I) ^— < (3.17) 

is an exact sequence of Lie algebra homomorphisms, as well as the sequence given 
by (3.6). We then say that the extension is a semidirect extension (or a semidirect 
sum of algebras) by analogy with the group case. More generally, we say that f) splits 
if it is isomorphic to a semidirect sum, which corresponds to to being a coboundary, 
not necessarily zero. If o is not Abelian, then (3.12) is not satisfied and two equivalent 
extensions (or two different choices of r) do not necessarily lead to the same p. 

Representing elements of f) as 2-tuples, as in Section 3.3, we can derive the 
bracket in 1) for a semidirect sum. The difference is that r is a homomorphism so 
that 

[(a 1 ,0),((3 1 ,0)] i) = [Ta 1 ,T(3 1 ] t) = T[a 1 ,/3i] g = ([«i ,/?i] ,O), 

and a is not assumed Abelian, 

[(0,a 2 ),(0,/3 2 )][, = [ion ,i fo]^ = i[a 2 , fo] a = (0 ,[a 2 , fola), 

which together with (3.14) gives 

[a,(3] h = {[a 1 ,(3 1 ] g , p ai fa - P/3 1 a 2 + [a 2 ,/?2] a J, (3.18) 
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Verifying Jacobi for (3.18) we find the p must also satisfy 

Pai [/?2,72] a = [p ai /32,72] a +[/?2,P ai 72] a , 

which is trivially satisfied if a is Abelian, but in general this condition states that p a 
is a derivation on o. 

Now consider the case where i _1 is a homomorphism and 

ker i~ = range r. 

Then the sequence 

„--i 



0:=^ a^^ fj ^^g:=^0 



is exact in both directions and, hence, both i and n = r 1 are bijections. The action 
of g on o is 

p a r] = i~ l [r a ,ir]\ = [r 1 ra,r ? ] Q = 

since by exactness i^ 1 or = 0. This is called a direct sum. Note that in this case the 
role of q and a is interchangeable and they are both ideals in f). The bracket in rj is 
easily obtained from (3.18) by letting p = 0, 

[a,P\ =([«i,/?i ] fl ,[a 2 ,ft] fl ). (3.19) 

Semidirect and direct extensions play an important role in physics. A simple example 
of a semidirect sum structure is when q is the Lie algebra so(3) associated with 
the rotation group SO (3) and a is M 3 . Their semidirect sum is the algebra of the 
six parameter Euclidean group of rotations and translations. This algebra can be 
used in a Lie-Poisson bracket to describe the dynamics of the heavy top (see for 
example [40,56,92]). We have already discussed the semidirect sum in Section 2.2.3. 
The bracket (2.13) is a semidirect sum, with q the algebra of the group of volume- 
preserving diffeomorphisms and a the Abelian Lie algebra of functions on M 2 . The 
action is just the adjoint action p a v := [ a , v ] obtained by identifying g and a. 
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In general, semidirect Lie-Poisson structures appear in systems where the 
field variables are in some sense "slaved" to the base variable (the one associated 
with f)) [57,90]. Here, the advected quantities are forced to move on the coadjoint 
orbits of the Lie group G. This is seen directly from the equations of motion (2.6), 
since, for a semidirect sum, 



f 



6 — £" 



t 



dQ <5H/<5£ S > 



which is by definition the infinitesimal generator of the coadjoint orbits of the Lie 
group [58] (see Section 2.1). For example, the coadjoint orbits of 50(3) are spheres, 
so the semidirect product 2 of SO (3) and M 3 leads to a physical system where the 
dynamics are confined to spheres, which naturally describes rigid body motion. In 
other words, the coadjoint orbits of the semidirect product of G and M 3 are isomorphic 
to the coadjoint orbits of G. We shall have more to say on this in Section 6.3.4. 

A Lie-Poisson bracket built from a direct sum is just a sum of the separate 
brackets. The dynamical interaction between the variables can only come from the 
Hamiltonian or from constitutive equations. For example, in the baroclinic instability 
model of two superimposed two fluid layers with different potential vorticities, the two 
layers are coupled through the potential vorticity relation [64]. A very similar model 
with a direct sum structure exists in MHD for studying magnetic reconnection [17]. 

3.4.1 Classification of Splitting Extensions 

We now briefly mention the connection between the first cohomology group and split- 
ting extensions. This will not be used directly in the classification scheme of Chap- 
ter 4, but we include it for completeness. We assume in this section that o is Abelian. 
In (3.17) we had chosen the canonical r, r(a) = (a, 0). Now suppose we use instead 

T f (a) = (a,v(a)). (3.20) 



2 Semidirect product is the term used for groups, semidirect sum for algebras. 
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Here v is a linear map from q to o and is thus an element of C 1 (g, a), a 1-cochain. 
If t' is a Lie algebra homomorphism, 

T'([a,[3] g ) = ([a,f3] g ,v([a,f3} g )) (3.21) 

must be equal to 

[T'(a),T'(f3)] t) =([a,P} s ,p a u(f3)-p fS u(a)) (3.22) 

subtracting (3.21) and (3.22) gives 

Pav(j5) - pfiv(a) -v([a,(3] g ) = sv(a,(3) = 0, (3.23) 

from (3.3). Hence v is a cocycle, with coboundaries given by 

v(a) = p a r] , 7] £ a, (3.24) 

The first cohomology group HJ>(q, a) classifies splitting extensions of f) by a modulo 
those given in terms of the coboundaries (3.24). 



Chapter 4 
Classification of Extensions of a Lie Algebra 



In this chapter we return to the main problem introduced in Section 2.3: the clas- 
sification of algebra extensions built by forming n-tuples of elements of a single Lie 
algebra 5. The elements of this Lie algebra \) are written as a := («i, . . . , a n ), on € 5, 
with a bracket defined by 

[a,(3] x = W x ^[a^p u ], (2.19) 

where W\^ u are constants. We will call n the order of the extension. Recall (see 
Section 2.3) that the W's are symmetric in their upper indices, 

W x ^ = W x vti , (2.20) 

and commute, 

w (y) W {t) = w {&) W rM ) (2.23) 

where the n x n matrices W^ u > are defined by [W^k := W\ u ^. Since the W's are 
3-tensors we can also represent their elements by matrices obtained by fixing the 
lower index, 

W W : [W w r-.= W X ^, (4.1) 

which are symmetric but do not commute. Either collection of matrices, (2.22) 
or (4.1), completely describes the Lie bracket, and which one we use will be under- 
stood by whether the parenthesized index is up or down. 
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What do we mean by a classification? A classification is achieved if we obtain 
a set of normal forms for the extensions which are independent, that is not related 
by linear transformations. We use linear transformations because they preserve the 
Lie-Poisson structure — they amount to transformations of the W tensor. We thus 
begin by assuming the most general W possible. 

We first show in Section 4.1 how an extension can be broken down into a 
direct sum of degenerate subblocks (degenerate in the sense that the eigenvalues 
have multiplicity greater than unity). The classification scheme is thus reduced to 
the study of a single degenerate subblock. In Section 4.2 we couch our particular 
extension problem in terms of the Lie algebra cohomology language of Section 3.2 
and apply the techniques therein. The limitations of this cohomology approach are 
investigated in Section 4.3, and we look at other coordinate transformations that 
do not necessarily preserve the extension structure of the algebra, as expressed in 
diagram (3.16). In Section 4.5 we introduce a particular type of extension, called the 
Leibniz extension, that is in a sense the "maximal" extension. Finally, in Section 4.6 
we give an explicit classification of solvable extensions up to order four. 

4.1 Direct Sum Structure 

A set of commuting matrices can be put into simultaneous block-diagonal form by a 
coordinate transformation, each block corresponding to a degenerate eigenvalue [89]. 
Let us denote the change of basis by a matrix Mp a , with inverse (M~ l ) _ , such that 
the matrix W^ v >, whose components are given by 

Wp Sv = {M- 1 ) l} X W x i»'M l * , 

is in block-diagonal form for all v [89]. However, If/" is a 3-tensor and so the third 
index is also subject to the coordinate change: 
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Figure 4.1: Schematic representation of the 3-tensor W for a direct sum of extensions. 
The cubes represent potentially nonzero elements. 

This last step only adds linear combinations of the W^'s together, so the W'W's 
and the W^'s have the same block-diagonal structure. Note that the Ws a ^ are 
still symmetric in their upper indices, since this property is preserved by a change of 
basis: 



wf-y 



^Wx^MfMS 



(M 



(Relabeling \i and v) 



(r^/^M^M/ 



So from now on we just assume that we are working in a basis where the W^ v > 's are 
block-diagonal and symmetric in their upper indices; this symmetry means that if we 
look at a W as a cube, then in the block-diagonal basis it consists of smaller cubes 
along the main diagonal. This is the 3-tensor equivalent of a block-diagonal matrix, 
as illustrated in Figure 4.1, a pictorial representation of a direct sum of extensions. 
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4.1.1 Example: three-field model of MHD 

We consider as an example of a direct sum structure a three-field model of MHD 
due to Hazeltine [30,32]. In addition to the vorticity to and the magnetic flux ip 
(see Section 2.2.3), the model also includes a field x which measures plasma density 
perturbations. The model includes as limits the RMHD system of Section 2.2.3 and 
the Charney-Hasegawa-Mima equation [41]. We thus have £ = (u,ip,x), with the 
Hamiltonian 



H = I(|V</>| 2 + |VVf + a X 2 



(4.2) 



and bracket represented by the matrices 

/l 0\ 

W (1) = 1 , w {2) = 






w^ 



The matrices commute and obey the symmetry (2.20), so they form a good bracket. 
As in Section 2.2.3, the electric potential is denoted by <f> and the electric current 
by J. The equations of motion are given by 

u = [u,(f>] + [ip,J], 

4 = [^,<t>} + a[x,^}, (4.3) 

The W^'s are not in block triangular form, and since W^ 3 ' has eigenvalues which are 
not threefold degenerate we know the extension can be blocked-up further. Indeed, 
the coordinate transformation rf" = ^ M v ^ , with 



M 






1 


1 





1 





(4.4) 



will transform the extension to 



W^ 



1 
1 













w^ 





1 













w^ 














1 



38 



where we have explicitly indicated the blocks. The extension is also block-diagonal 
in the alternate, lower-indexed representation, 



IF, 



(i) 



1 














IT, 



(2) 



1 

1 













IT 



(3) 














1 



This is what was meant by "cubes" at the end of the previous section. 

At the bracket level the variables rj 1 and n 2 are decoupled from rf. But under 
the transformation (4.4) the Hamiltonian (4.2) becomes 



H = \ ( \V(r] 1 + jf)\ 2 + |Vr/ 2 | 2 + a\ri 



1 1 2 



The new equations of motion are thus 

r l 1 =[v l ,4>] + [ri\J], 

V 2 = [v 2 , 4> - " V 1 ] , 

with V 2 (j) := 7] 1 + r/ 3 and J := V 2 r/ 2 . The variable ?? 3 is still coupled to the other 
variables through the defining relation for <p. 



4.1.2 Lower-triangular Structure 

Block-diagonalization is the first step in the classification: each subblock of W is 
associated with an ideal (hence, a subalgebra) in the full n-tuple algebra q. (A 
subset o C f) is an ideal in the Lie algebra f) if [f) , a] C a. Ideals are subalgebras.) 
Hence, by the definition of Section 3.4, the algebra g is a direct sum of the algebra 
denoted by each subblock. Each of these algebras can be studied independently, 
that is we can focus our attention on a single subblock. So from now on we assume 
that we have n commuting matrices, each with n-fold degenerate eigenvalues. The 
eigenvalues can, however, be different for each matrix. 

Such a set of commuting matrices can be put into lower-triangular form by 
a coordinate change, and again the transformation of the third index preserves this 
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structure (though it can change the eigenvalue of each matrix). The eigenvalue of 
each matrix lies on the diagonal; we denote the eigenvalue of W^> by A^' . We write 
the quantity W\^ u as the matrix 



Wi 



/A« 

AM 



(i) 



0\ 




\A( n ) ••• 0/ 

which consists of the first row of the lower-triangular matrices W^' as prescribed by 
(4.1). Evidently, the symmetry of Wn\ requires 

that is, all the matrices W^' are nilpotent (their eigenvalues vanish) except for W^ 1 ' 
when 9^0. If this first eigenvalue is nonzero then it can be scaled to 9 = 1 by 
the coordinate transformation M u a = 9~ l 5 v a . We will use the symbol 9 to mean a 
variable which can take the value or 1. Figure 4.2 shows the structure, with 9 = 0, of 
a degenerate extension, after lower-triangularity and symmetry of the upper indices 
of W are taken into account. 

4.2 Connection to Cohomology 

We now bring together the abstract notions of Chapter 3 with the n-tuple extensions 
of Section 2.3. It is shown in Section 4.2.1 that we need only classify the case 
of 9 = 0. This case will be seen to correspond to solvable extensions, which we 
classify in Section 4.2.2. 



4.2.1 Preliminary Splitting 

Assume we are in the basis described at the end of Section 4.1 and, for now, sup- 
pose 9 = 1. To place the structure of W in the context of Lie algebras, we first give 
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Figure 4.2: Schematic representation of the 3-tensor W for a solvable extension. The 
cubes represent potentially nonzero elements. The vertical axis is the lower index A 
of W\^ u , and the two horizontal axes are the symmetric upper indices /x and v. 
The origin is at the top-rear. The pyramid-like structure is a consequence of the 
symmetry of W and of its lower-triangular structure in this basis. 
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some definitions. The derived series Q^ k ' of g has terms 



(O) = 
n(i)-r, 



,(2) 



(1) ,0 (1) 



,(fc) 



j( fc-l) >fl (fc-l) 



(4.5) 



where by [0,0] we mean the set obtained by taking all the possible Lie brackets of 
elements of g. The lower central series g h has terms defined by 



0° = 


= 




s 1 - 


= [0 


0] 


2 = 


= [* 


,0 X 



,(*-!) 



(4.6) 



An algebra g is said to be solvable if its derived series terminates, Q^ k ' = 0, for some k. 
An algebra g is said to be nilpotent if its lower central series terminates, g = 0, for 
some k. Note that a nilpotent algebra is solvable, but not vice- versa [44]. 

The set of elements of the form (3 = (0, /%, • • • , (3 n ) is a nilpotent ideal in f) 
that we denote by a (o is thus a solvable subalgebra). To see this, observe that (4.6) 
involves nested brackets, so that the elements a of the lower central series will 
involve kth powers of the W^' . But since the W^' with \x > 1 are lower-triangular 
with zeros along the diagonal, we have (W^>) n ~ l = 0, and the lower central series 
must eventually vanish. 

Because o is an ideal, we can construct the algebra g = f)/a, so that f) is an 
extension of g by a. If g is semisimple, then a is the radical of f) (the maximal solvable 
ideal). It is easy to see that the elements of g embedded in I) are of the form a = 
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(ai,0, . . . , 0). We will now show that f) splits; that is, there exist coordinates in 
which f) is manifestly the semidirect sum of g and the (in general non-Abelian) 
algebra o. 

In Appendix B we give a lower-triangular coordinate transformation that 
makes W^ l > = /, the identity matrix. Assuming we have effected this transformation, 
the mappings i, n, and r of Section 3.2 are given by 

i : a — ► f), i(a 2 , . . . , a n ) = (0, a 2 , ■ ■ ■ , a n ), 
7r : f) — > Q, 7v(a 1 ,a 2 ,...,a n ) = a t , 
t : g — ► f), t(qi) = (ai,0, ...,0), 

and the cocycle of Eq. (3.9) is 

iu{a,(3) = [Ta,T/3] t) -T[a,/3] g 

= [(Oi, 0,...,0),(y9i,0,...,0) ](,-([«!,/?! ],0,...,0) 

= (^ n [ai,/3i],0,...,0) -([ai,/?i],0,...,0) 
= 0, 

since Wi = 1. Hence, the extension is a semidirect sum. The coordinate transfor- 
mation that made W^- 1 ' = I removed a coboundary, making the above cocycle vanish 
identically. For the case where g is finite-dimensional and semisimple, we have an ex- 
plicit demonstration of the Levi decomposition theorem: any finite-dimensional 1 Lie 
algebra rj (of characteristic zero) with radical a is the semidirect sum of a semisimple 
Lie algebra g and o [44] . 

4.2.2 Solvable Extensions 

Above we assumed the eigenvalue 6 of the first matrix was unity; however, if this 
eigenvalue vanishes, then we have a solvable algebra of n-tuples to begin with. Since n 
is arbitrary we can study these two solvable cases together. 



The inner bracket can be infinite dimensional, but the order of the extension is finite. 
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Thus, we now suppose t) is a solvable Lie algebra of n-tuples (we reuse the 
symbols f), 0, and o to parallel the notation of Section 3.1), where all of the the W^w's 
are lower-triangular with zeros along the diagonal. Note that W^ n > = 0, so the set 
of elements of the form a = (0, . . . , 0, a n ) forms an Abelian subalgebra of f). In fact, 
this subalgebra is an ideal. Now assume () contains an Abelian ideal of order n — m 
(the order of this ideal is at least 1), which we denote by a. The elements of a can 
always be cast in the form 

a = (0, ... , 0, a m+ i,... ,a n ) 

via a coordinate transformation that preserves the lower-triangular, nilpotent form 
of the W("). 

We also denote by g the algebra of m-tuples with bracket 

■m 
[(ai,..., a m ), (/3i,..., Pm)] sx = E W x ^ [a M ,A,] , A = l,...,m. 

It is trivial to show that q = rj/a, so that t) is an extension of by o. Since a 
is Abelian we can use the formalism of Section 3.1 (the other case we used above 
was for a non- Abelian but where the extension was semidirect). The injection and 
projection maps are given by 



a, j • 



i : a — ► f), i(a m+ i,... ,a n ) = (0, . . . ,0,a m+ i, . . . ,a 
7T : i) — > 0, 7r(«i, a 2 , ■■■ ,a n ) = («i, . . . , a m ), 

T '■ > f); r («l, ■■■ , «m) = (ai, ■ ■ ■ , CXm, 0, . . . , 0). 

From the definition of the action, Eq. (3.7), we have for a £ q and 77 G 0, 
ip a r]= [ra,ir]\ 

= [ (ai , . . . , dm, 0, . . . , 0) , (0, . . . , 0, 7] m+1 , . . . , rj n ) ] h 

m n—1 

= E E (0,---,0,W^ 2 [a fl ,7 lu ],...,W n ^[a fl ,7 lu }). 

/i=l u=m+l 



(4.7) 
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In addition to the action, the solvable extension is also characterized by the cocycle 
defined in Eq. (3.9), 

iu(a,/3) = [Ta,T(3] i) -T[a,(3] g 

= [(ai,..., a m ,0, ... ,0),(/3i,...,/3 m ,0,...,0)]j ) 

-r[(ai,...,a m ), (/3i,..., /3 m )] 

m 

= ^(0,...,0,C + i[« )1 ,^] 1 -.,^' , > Ml ^]). (4.8) 

fJ,,U=l 

We can illustrate which parts of the W's contribute to the action and which to the 
cocycle by writing 



*«--£■ 



-jj*- , A = m+l,...,n, (49) 



where the w^'s are m x m symmetric matrices that determine the cocycle uj and 
the r^'s are m x (n — m) matrices that determine the action p. The zero matrix of 
size (n — m) x (n — m) on the bottom right of the Wr\) ' s appears as a consequence 
of o being Abelian. 

The algebra g is completely characterized by the W/\\, A = 1, . . . , m. Hence, 
we can look for the maximal Abelian ideal of g and repeat the procedure we used for 
the full t). It is straightforward to show that although coordinate transformations 
of g might change the cocycle u and the action p, they will not alter the form of (4.9). 

Recall that in Section 3.1 we defined 2-coboundaries as 2-cocycles obtained 
from 1-cochains by the coboundary operator, s. The 2-coboundaries turned out to 
be removable obstructions to a semidirect sum structure. Here the coboundaries are 
associated with the parts of the Wt\\ that can be removed by (a restricted class of) 
coordinate transformations, as shown below. 

Let us explore the connection between 1-cochains and coboundaries in the 
present context. Since a 1-cochain is just a linear mapping from g to a, for a = [a\ , . . . , a m ) G g 
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we can write this as 



,(i), 



a) = -^A^ A a A , fj, = m + l,...,n, 



(4.10) 



A=l 



where the k^ are arbitrary constants. To find the form of a 2-coboundary we act on 
the 1-cochain (4.10) with the coboundary operator; using (3.3) and (4.7) we obtain 



u;l ob (a,(3) = ( S u;^)(a,(3) 



(i)> 



Pa ^ 1 )(/3)+ / > /3 ^ 1 )(a)-J 1 )([a,/3] ) 

m n 



m n 



E E w» 



/«/ 



<fy , w^ 1 - 1 



fj.=l v=m+l 



[l,V, <J = l 



(4.11) 



After inserting (4.10) into (4.11) and relabeling, we obtain the general form of a 
2-coboundary 



«,f*>(a,l3)= Y Va^K,/3,], A = m + 1,..., 



n, 



where 



W- := J^ fc A T W^" - Yl (k^W x ua + k a u W x ^). 



(4.12) 



T=l 



<r=m+l 



To see how coboundaries are removed, consider the lower-triangular coordi- 
nate transformation 



[Mj 







cl 



where a labels rows. This transformation subtracts V/\\ from W/\\ for X> m and 
leaves the first m of the Wi\\ 's unchanged. In other words, if W is the transformed W, 



(W, 



w, 



(A) 



(A) 



c -i WA _ v A ) 



r A 



A = 1, . . . , to; 
X = m + 1, . . . ,n. 



(4.13) 
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We have also included in this transformation an arbitrary scale factor c. Since by (4.8) 
the block in the upper-left characterizes the cocycle, we see that the transformed co- 
cycle is the cocycle characterized by w\ minus the coboundary characterized by V^. 

The special case we will encounter most often is when the maximal Abelian 
ideal of \) simply consists of elements of the form (0, . . . , 0, a n ). For this case m = 
n—1, and the action vanishes since W n ^ n = (the extension is central). The cocycle u 
is entirely determined by Wi n \. The form of the coboundary is reduced to 

n— 1 

V r r = Y,k n T W T ^, (4.14) 

T=l 

that is, a linear combinations of the first (n — 1) matrices. Thus it is easy to see 
at a glance which parts of the cocycle characterized W( n \ can be removed by lower- 
triangular coordinate transformations. 

4.3 Further Coordinate Transformations 

In the previous section we restricted ourselves to lower-triangular coordinate trans- 
formations, which in general preserve the lower-triangular structure of the W^' . But 
when the W^' matrices are relatively sparse, there exist non-lower-triangular coor- 
dinate transformations that nonetheless preserve the lower-triangular structure. As 
alluded to in Section 3.3, these transformations are outside the scope of cohomology 
theory, which is restricted to transformations that preserve the exact form of the 
action and the algebras g and a, as shown by (4.13). In other words, cohomology 
theory classifies extensions given q, o, and p. We need not obey this restriction. We 
can allow non-lower-triangular coordinate transformations as long as they preserve 
the lower-triangular structure of the W^'^s. 

We now discuss a particular class of such transformations that will be useful 
in Section 4.6. Consider the case where both the algebra of (n — l)-tuples and that 
of 1-tuples a are Abelian. Then the possible (solvable) extensions, in lower triangular 
form, are characterized by Wrx) =0, A = l,...,n— 1, with W( n ) arbitrary (except 
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for W n ^ n = 0). Let us apply a coordinate change of the form 



M 









where m is an (n — 1) x (n — 1) nonsingular matrix and c is again a nonzero scale 
factor. Denoting by W the transformed W, we have 

'0 A = l,...,n-1; 



^(A) 



c * m T w> m 







\ (4-15) 

X = n. 







This transformation does not change the lower-triangular form of the exten- 
sion, even if m is not lower-triangular. The manner in which w n is transformed 
by M is very similar to that of a (possibly singular) metric tensor: it can be diago- 
nalized and rescaled such that all its eigenvalues are or ±1. We can also change the 
overall sign of the eigenvalues using c (something that cannot be done for a metric 
tensor). Hence, we shall order the eigenvalues such that the +l's come first, followed 
by the — l's, and finally by the 0's. We will show in Section 4.6 how the negative 
eigenvalues can be eliminated to harmonize the notation. 

4.4 Appending a Semisimple Part 

In Section 4.2 we showed that because of the Levi decomposition theorem we only 
needed to classify the solvable part of the extension for a given degenerate block. 
Most physical applications have a semisimple part (0 = 1); when this is so, we 
shall label the matrices by VF^ ', W^\ . . . , W^ n > , where they are now of size n + 1 
and W^ ' is the identity. 2 Thus the matrices labeled by VF' 1 ^, . . . , W^ n ' will always 
form a solvable subalgebra. This explains the labeling in Sections 2.3.1 and 2.3.2. 

If the extension has a semisimple part (9 = 1, or equivalently W^°' = /), we 
shall refer to it as semidirect. This was the case treated in Section 4.2.1. A pic- 
torial representation of an arbitrary semidirect extension with nonvanishing cocycle 



2 The term semisimple is not quite precise: if the base algebra is not semisimple then neither is 
the extension. However we will use the term to distinguish the different cases. 
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Figure 4.3: Front and rear views of a schematic representation of the 3-tensor W 
for an arbitrary semidirect extension with cocycle. The solvable part is in red. The 
semisimple part is in blue and consists of unit entries. The axes are as in Figure 4.2. 
An extension with all these elements nonzero cannot actually occur. 



is shown in Figure 4.3. If the extension is not semidirect, then it is solvable (and 
contains n matrices instead of n + 1). This is the extension represented in Figure 4.2. 

Given a solvable algebra of n-tuples we can carry out in some sense the inverse 
of the Levi decomposition and append a semisimple part to the extension. Effectively, 
this means that the n x n matrices W^~> ,..., W^ n ' are made n + lxn + lby adding 
a row and column of zeros. Then we simply append the matrix W^> = I to the 
extension. In this manner we construct a semisimple extension from a solvable one. 
This is useful since we will be classifying solvable extensions, and afterwards we will 
want to recover their semidirect counterpart. 

The extension obtained by appending a semisimple part to the completely 
Abelian algebra of n-tuples will be called pure semidirect. It is characterized by W^> = 
I, and W\L IU = for /j,, v > 0. This is shown schematically in Figure 4.4. 
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Figure 4.4: Schematic representation of the 3-tensor W for a pure semidirect exten- 
sion. The axes are as in Figure 4.2. 



4.5 Leibniz Extension 

A particular extension that we shall consider is called the Leibniz extension [81]. For 
the solvable case this extension has the form 



W {1) =: TV 



1 
1 

V 



\ 



i oy 



(4.16) 



or W\ 



W^ 1 



}\-r 



l , A > 1. The first matrix is an n x n Jordan block. In this case the 



other matrices, in order to commute with W^' , must be in striped lower-triangular 
form [89], 



W {u) 



(o 






\ 


a 









b 


a 







c 


b 


a 




d 


c 


b a 





(4.17) 



V 
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Figure 4.5: Two views of the 3-tensor W for a solvable Leibniz extension, where each 
cube denotes a 1. The axes are as in Figure 4.2. The Leibniz extension is "hollow." 

But by symmetry of the upper indices the first column of matrix W^ u > must be W\ ^ v ' = 5\ u , 
so that 



w (v) = ( N y f 



(4.18) 



where on the right-hand side the v denotes an exponent, not a superscript. An 
equivalent way of characterizing the Leibniz extension is 



W > T = 8 X '* V , ii,v,\ = l,...,n. 



(4.19) 



The tensor 5 is an ordinary Kronecker delta. Note that neither (4.18) nor (4.19) 
are covariant expressions, reflecting the coordinate-dependent nature of the Leibniz 
extension. 

The Leibniz extension is in some sense a "maximal" extension: it is the only 
extension that has Wt\\ ^ for all A > 1 (up to coordinate transformations). Its 
uniqueness will become clear in Section 4.6, and is proved in Section 4.7. We show 
two schematic views of the extension in Figure 4.5. Fans of 1980's arcade games 
will understand why the author is suggesting the alternate name Q*Bert extension, 3 



3 n* 



Q*Bert is a trademark of the Sony Corporation. 
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Figure 4.6: Screenshot of the Q*Bert game. Compare with Figure 4.5! 

since Leibniz has no dearth of things named after him (see Figure 4.6). 

To construct the semidirect Leibniz extension, we append W^ ' = /, a square 
matrix of size n + 1, to the solvable Leibniz extension above, as described in Sec- 
tion 4.4. The characterization given by Eq. (4.19) can be used for the semidirect 
Leibniz extension by simply letting the indices run from to n. 

4.6 Low-order Extensions 

We now classify algebra extensions of low order. As demonstrated in Section 4.2 we 
only need to classify solvable algebras, which means that W^ n ' = for all cases. We 
will do the classification up to order n = 4. For each case we first write down the most 
general set of lower-triangular matrices W^ u ' (we have already used the fact that a 
set of commuting matrices can be lower-triangularized) with the symmetry W\^ v = 
W\ Ufl built in. Then we look at what sort of restrictions the commutativity of the 
matrices places on the elements. Finally, we eliminate coboundaries for each case by 
the methods of Sections 4.2 and 4.3. This requires coordinate transformations, but 
we usually will not bother using new symbols and just assume the transformation 
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were effected. 

Note that, due to the lower-triangular structure of the extensions, the clas- 
sification found for an m-tuple algebra applies to the first m elements of an n-tuple 
algebra, n> m. Thus, Wi n \ is the cocycle that contains all of the new information 
not included in the previous m = n — 1 classification. These comments will become 
clearer as we proceed. 

There are three generic cases that we will encounter for any order: 

1. The Leibniz extension, discussed in Section 4.5. 

2. An extension with Wr\\ =0, A = 1, . . . ,n — 1, and Wr n \ arbitrary (and sym- 
metric). This extension automatically satisfies the commutativity requirement, 
because the product of any two W^' vanishes. It can be further classified by 
the methods of Section 4.3. Later we will refer to this case as having a vanishing 
coextension (see Section 5.4 and Figure 6.3). 

3. The Abelian extension, which vanishes identically: Wt\\ = 0, A = 1, . . . , n. This 
is a special case of 2, above. When appended to a semidirect part (as explained 
in Section 4.4), the Abelian extension generates the pure semidirect extension. 

We shall call an order n extension trivial if Wt n ) = 0, so that the cocycle 
appended to the order n — 1 extension contributes nothing to the bracket. 

We now proceed with the classification for orders n = 1 to 4. 

4.6.1 n=l 

This case is Abelian, with the only possible element W\ = 0. 

4.6.2 n=2 



The most general lower-triangular form for the matrices is 

w 2 u o) ' Uo 
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If W2 7^ 0, then we can rescale it to unity. Hence, we let W2 '■= 0i, where 6\ = 
or 1. The case 6± = is the Abelian case, while for = 1 we have the n = 2 Leibniz 
extension (Section 4.5). Thus for n = 2 there are only two possible algebras. The 
cocycle which we have added at this stage is characterized by 6\. 

4.6.3 n=3 

Using the result of Section 4.6.2, the most general lower-triangular form is 

/ 0\ 

W (1) = I 0i , w {2) = 












0i 








v, 11 


w^ 
























w^ 


w 3 22 






and W^ 3 ' = 0. These satisfy the symmetry condition (2.20), and the requirement 
that the matrices commute leads to the condition 



e l wv = 



The symmetric matrix representing the cocycle is 

'W3 11 W 3 21 

'(3) 



W (3) = W 3 21 W 3 22 . (4.20) 



0; 

If 01 = 1, then W 3 must vanish. Then, by (4.14) we can remove from Wr 3 \ a 

multiple of Wq), and therefore we may assume W 3 vanishes. A suitable rescaling 

allows us to write W 3 = 02, where 02 = or 1. The cocycle for the case 0i = 1 is 

thus 

/0 02 0^ 
W {3) =0 2 

\0 Oy 

For 02 = 1 we have the Leibniz extension (Section 4.5). 

If 0i = 0, we have the case discussed in Section 4.3. For this case we can 
diagonalize and rescale Wt 3 ) such that 



W {3) 
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where (Ai,A2) can be (1,1), (1,0), (0,0), or (1,-1). This last case, as alluded to 
at the end of Section 4.3, can be transformed so that it corresponds to 0\ = 0, 
02 = 1. The choice (1,0) can be transformed to the 0\ = 1, 02 = case. Finally 
for (Ai, A2) = (1, 1) we can use the complex transformation 

to transform to the 0\ = 0, 02 = 1 case. 

We allow complex transformations in our classification because we are chiefly 
interested in finding Casimir invariants for Lie-Poisson brackets. If we disallowed 
complex transformations, the final classification would contain a few more members. 
The use of complex transformations will be noted as we proceed. 

There are thus four independent extensions for n = 3, corresponding to 

(0i,0 2 )e{(O,O),(O,l),(l,O),(l,l)}. 

These will be referred to as Cases 1-4, respectively. Cases 1 and 3 have 02 = 0, and 
so are trivial (Wt^\ = 0). Case 2 is the solvable part of the compressible reduced 
MHD bracket (Section 2.3.2). Case 4 is the solvable Leibniz extension. 

4.6.4 n=4 

Proceeding as before and using the result of Sections 4.6.2 and 4.6.3, we now know 
that we need only write 



W {4) 



fW 4 n W 4 21 W 4 31 0\ 

W 4 21 W 4 22 W 4 32 

w 4 31 w 4 32 w 4 33 

\ 0/ 



(4.21) 



The matrices Wn\, Wm, an d 1^(3) are given by their n = 3 analogues padded with an 
extra row and column of zeros (owing to the lower-triangular form of the matrices) . 
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The requirement that the matrices V^' 1 ) . . . W^> commute leads to the conditions 



9 2 W4 33 = 0, 



? 2 W 4 31 = 9 1 W 4 22 , 
9 2 W4 32 = 0, 



?! VF4 32 = 0. 



(4.22) 



There are four cases to look at, corresponding to the possible values of 6\ and # 2 . 

Case 1 6>i = 0, G 2 = 0. 

This is the unconstrained case discussed in Section 4.3, that is, all the com- 
mutation relations (4.22) are automatically satisfied. We can diagonalize to give 



W, 



(i) 



/Ai 0\ 

A' 2 

X' 3 

\0 0/ 



where 



(Ai, A 2 , A' 3 ) G {(1, 1, 1), (1, 1,0), (1,0,0), (0,0,0), (1, 1, -1), (1, -1,0)} , 

so there are six distinct cases. The exact form of the transformation is unimportant, 
but the (1,1,0) extension can be mapped to Case 2 (the transformation is com- 
plex), (1,0,0) can be mapped to Case 3a, and (1,-1,0) can be mapped to Case 2. 
Finally the (1,1,1) extension can be mapped to the (1,1,-1) case by a complex 
transformation. 

After transforming that (1, 1, —1) case, we are left with 



W, 



(-1) 



/o o\ /o 1 0\ 

10 

' 1 

\o 0/ \o 0/ 



These will be called Cases la and lb. 



Case 2 9 1 = 0,0 2 = 1. 
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The commutation relations (4.22) reduce to Wa 



31 



Wa 32 



W A 33 = 0, and 



we have 



W, 



( w \ 


w 4 21 





o\ 


w 4 21 


Wa 22 




















V 








0/ 



(1) 



We can remove W\ because it is a coboundary (in this case a multiple of W($\). 
We can also rescale appropriately to obtain the four possible extensions 



W, 



/o 0\ 




/l 0\ 




/l 0\ 




/l 0\ 












10 




0-100 





1 





1 





•} 





\0 0/ 




\o 0/ 




\0 0/ 




\o 0/ 



(1) 



Again, the form of the transformation is unimportant, but it turns out that the 
second extension can be mapped to Case 3c, and the third and fourth to Case 3b. 
This last transformation is complex. Thus there is only one independent possibility, 



the trivial extension Wi 



(i) 



0. 



Case 3 6»i = 1,0 2 = 0. 



We can remove W4 using a coordinate transformation. From the commu- 



tatioi 


1 requirement (4.22) we ob1 


:ain W 4 22 = W 4 32 = 0. 1 


and 






/ Wa 21 Wa 31 0\ 




w (4) = 


Wa 21 
W A 31 W A 33 

\ 00/ 



0. We are left with W, 



(3) 







Using the fact that elements of the form (0,02,0,04) are an Abelian ideal of this 
bracket, we find that W4 W4 = 0. Using an upper-triangular transformation we 



2ii 



can also make W 4 W 4 
the trivial extension Wi 



31 



(i) 



0. After suitable rescaling we find there are five cases: 
0, and 



/o o\ /o 1 0\ 



w, 



(1) 




10 

\o 0/ 




10 

\o 0/ 



/o 1 0\ 

10 

10 

\o 0/ 



/o 1 0\ 

10 



\o 0/ 
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However the last of these may be mapped to Case 4 (below) with #3 = 0. We will 
refer to the trivial extension as Case 3a and to the remaining three extensions as 
Cases 3b-d, respectively. 

Case 4 0i = 1, 2 = 1- 

The elements W4 and W4 are coboundaries that can be removed by a 
coordinate transformation. From (4.22) we have H4 33 = W4 32 = 0, W4 22 = W4 31 =: 
03, so that 



W {4) 



/0 3 0\ 

3 

3 

Vo 0/ 



For 03 = 1 we have the Leibniz extension. The two cases will be referred to as 
Case 4a for 3 = and 4b for 3 = 1. 

Table 4.1 summarizes the results. There are are total of nine independent n = 
4 extensions, four of which are trivial {Wu\ =0). As noted in Section 4.5 only the 
Leibniz extension, Case 4b, has nonvanishing Wu\ for all 1 < % < n. 

The surprising fact is that even to order four the normal forms of the ex- 
tensions involve no free parameters: all entries in the coefficients of the bracket are 
either zero or one. There is no obvious reason this should hold true if we try to 
classify extensions of order n > 4. It would be interesting to find out, but the clas- 
sification scheme used here becomes prohibitive at such high order. The problem is 
that some of the transformations used to relate extensions cannot be systematically 
derived and were obtained by educated guessing. 

4.7 Leibniz as the Maximal Extension 

We mentioned in Section 4.5 that the Leibniz extension is maximal: it is the only 
extension that has Wr\\ 7^ for all A > 1. Having seen the classification process at 
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Case W(2) W( 3 ) Wu\ 



(0) 



(o) 



» 



» 








1 

1 








1 

1 




Table 4.1: Enumeration of the independent extensions up to n = 4. We have Wry\ = 
for all the cases, and we have left out a row and a column of zeros at the end of each 
matrix. 
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work in Section 4.6, we are now in a position to show why the Leibniz extension has 
this property. We will demonstrate that the only way to extend a Leibniz extension 
nontrivially (i.e., with a nonvanishing cocycle) is to append a cocycle such that the 
new extension is again Leibniz. 

Consider a solvable Leibniz extension of order n — 1, denoted by the 3- 
tensor W. We increase the order of W by one by appending the most general cocycle 
possible (as was done in Section 4.6) to obtain an extension of order n denoted by 
the tensor W. The form of the matrices W^> of the new extension is 



lyW 



w n v> 







n 



1, 



,n 



1. 



(4.23) 



and W^ n > = 0. The quantity W n ^> is a row vector defined in the obvious manner 
as \W n M]" = WrT- 

In particular, the first matrix of the nth order extension is 



/ o 
1 



W™ 





1 



1 







l,n-l 



(4.24) 



0/ 



V Wn 11 W n 12 ■ ■ ■ Wn 1 '"- 2 W n 

where the W^" represent the appended cocycle, and we have explicitly delimited the 
order n— 1 Leibniz extension. It is not difficult to show that the W n ly , v = 1, . . . , n — 2, 
are coboundaries and so can be removed by a coordinate transformation. We thus 
assume that W n v = 0, u = 1, . . . ,n — 2. The only potentially nonzero element of 
that row is Wn 1 ' 71 ' 1 . 

Taking the commutator of two matrices of the form (4.23) gives the conditions 

n— 1 n— 1 

Y J W n ^W a ,/T = Y J Wn Ua W^ T , fi,v,T = l,...,n-l. 

CT=1 cr=l 

Substituting the form of the Leibniz extension (4.19) for W, this becomes 

n— 1 n— 1 



(7=1 



(7 = 1 
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or 

W n "' u+T = 
where v + r < n. For r = 1, this is 

W»' u+1 = 



\i + r > n, 



'W,^ +1 ' V n<n-l, 
jU = n — 1, 



for z/ = 1, . . . , n — 2, which says that W( n ) has a banded structure. Because we have 



that W 7 



\v 



0, i/ = ] 


,...,n 


— 2, it must 


3e t' 


rat 










^ ° 










Wn 1 '"- 1 


o\ 












Wn 1 '"- 1 










^(n) = 





Wn 1 '"' 1 

















W r n 1 ' B - 


1 

















^ o 















0/ 



So either W„ ,n =0 (the extension is trivial), or W n ' n can be rescaled to unity 
(the extension is of the Leibniz type). 

Thus, if one has a Leibniz extension of size n — 1 then the only way to 
nontrivially extend it is to make it the Leibniz extension of size n. But since the 
Leibniz extension is the only nontrivial extension of order 2 (see Section 4.6.2), we 
have shown the uniqueness of the maximal extension, up to a change of coordinates. 



Chapter 5 



Casimir Invariants for Extensions 



In this chapter we will use the bracket extensions of Chapter 4 to make Lie-Poisson 
brackets, following the prescription of Chapter 2. In Section 5.1 we write down the 
general form of the Casimir condition (the condition under which a functional is a 
Casimir invariant) for a general class of inner brackets. Then in Section 5.2 we see how 
the Casimirs separate for a direct sum of algebras, the case discussed in Section 4.1. 
Section 5.3 discusses the particular properties of Casimirs of solvable extensions. In 
Section 5.4 we give a general solution to the Casimir problem and introduce the 
concept of coextension. Finally, in Section 5.5 we work out the Casimir invariants 
for some specific examples, including CRMHD and the Leibniz extension. 

5.1 Casimir Condition 

A generalized Casimir invariant (or Casimir for short) is a function C : q* — > R for 
which 

{F,C} = 0, 

for all F : q* — ► R. Using (2.1) and (2.5), we can write this as 



5_F_ 5C_ 
Since this vanishes for all F we conclude 



6£ 

1£ 



5F_ 



5£ ' ? 



0. 



(5.1) 



61 



02 



To figure out the coadjoint bracket corresponding to (2.19), we write 

<£, [a, /3]) = (^ A ,^ A ^[a M , A,]), 
which after using the coadjoint bracket in g becomes 

so that 



[/3,£ 



it" 



/^,£ A 



(5.2) 



We can now write the Casimir condition (5.1) for the bracket extension as 



W x 



\xv 






0, v = 0, . . . ,n. 



(5.3) 



We now specialize the bracket to the case of most interested to us, where 
the inner bracket is of canonical form (2.11). (We will touch briefly on the finite- 
dimensional case in Section 5.1.1, but the remainder of the thesis will deal with a 
canonical inner bracket unless otherwise noted.) As we saw in Chapter 2, this is 
the bracket for 2-D fluid flows. Further, we assume that the form of the Casimir 
invariants is 



C[£] 



C(£(x))d 2 x, 



(5.4) 



and thus, since C does not contain derivatives of £, functional derivatives of C can 
be written as ordinary partial derivatives of C. We can then rewrite (5.3) as 

d 2 c 



w^ v 



d&dt a 



r,r 



0, v = 0, . . . ,n. 



(5.5) 



In the canonical case where the inner bracket is like (2.11) the [£ CT , £ A ] are indepen- 
dent and antisymmetric in A and a. Thus a necessary and sufficient condition for 
the Casimir condition to be satisfied is 



, d 2 c 
d£i*d€ o 



w. 



[iv 



d 2 c 



(5.6) 
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for A, a, v = 0, . . . , n. Sometimes we shall abbreviate this as 

W X ^C^ = W^C^x , (5.7) 

that is, any subscript /ionC following a comma indicates differentiation with respect 
to £ M . Equation (5.7) is trivially satisfied when C is a linear function of the £'s. That 
solution usually follows from special cases of more general solutions, and we shall 
only mention it in Section 5.4.2 where it is the only solution. 

An important result is immediate from (5.7) for a semidirect extension. 
Whenever the extension is semidirect we shall label the variables ^°,^ 1 , . . . ,£ n , be- 
cause the subset ^ 1 ,...,^ n then forms a solvable subalgebra (see Section 4.4 for 
terminology). For a semidirect extension, W^ ' is the identity matrix, and thus (5.7) 
gives 

h^C^ a = 5 a ^C^ x , 

which is satisfied because we can interchange the order of differentiation. Hence, 
v = does not lead to any conditions on the Casimir. However, the variables fi, A, a 
still take values from to n in (5.7). 

5.1.1 Finite-dimensional Casimirs 

For completeness, we briefly outline the derivation of condition (5.7) for a finite- 
dimensional algebra, though we shall be concerned with the canonical inner bracket 
for the remainder of the thesis. The Lie-Poisson bracket can be written 

{/.fll^^^.^H, (5.8) 

where the A are the structure constants of the algebra q. The roman indices denote 
the components of each £ M , in the same manner as the rigid body example of Sec- 
tion 2.2.1, and / and g are ordinary functions of the (?. The Casimir condition (5.3) 
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is thus 

BC 
W>T 0*^^ = 0, (5.9) 

where both f and j are free indices. From the structure constants we can construct 
the Cartan-Killing form [29,44], 

%:=4<&- (5-10) 

The Cartan-Killing form is symmetric, and is nondegenerate for a semisimple alge- 
bra. We assume this is the case for g, and denote the inverse of &ij by & lJ . 

For definiteness we take a Casimir of the form 

c = \^c,AHh ( 5 - n ) 

where C^ is a symmetric tensor. Inserting this into (5.9), we get 

W x >»'<% j X 9 C IMT %& = Q. (5.12) 

The symbol rf := & sl c^ ■ can be shown to be antisymmetric in its upper indices. (We 
use the Cartan-Killing form as a metric to raise and lower indices.) We can then 
define the bracket [ , ]* : g* x g* — ► g* by 

[t'VYk :=c k &VJ, (5-13) 

which is a Lie bracket on g* induced by the Cartan-Killing form R. The Casimir 
condition (5.12) can be rewritten neatly in terms of the bracket [ , ]* as 



wrc^ 



i\i x =0. (5.14) 



This should be compared with condition (5.5), for the infinite-dimensional case, 
where the bracket [ , ]* is obtained from the identification of g and g* . The Casimir (5.11) 
is thus the finite-dimensional analogue of (5.4). Since condition (5.12) has to be true 
for any value of the £, it follows that we must have 

W x ^C IMr = W a ^C llX , (5.15) 



65 



the same condition as (5.7). We conclude that, even thought we shall be concerned 
with the canonical bracket case, many of the subsequent results of this chapter apply 
to finite-dimensional brackets. 

5.2 Direct Sum 

For the direct sum we found in Section 4.1 that if we look at the 3-tensor W as a cube, 
then it "blocks out" into smaller cubes, or subblocks, along its main diagonal, each 
subblock representing a subalgebra. We denote each subblock of W\^ v by Wi\ 



i = 1, . . . , r, where r is the number of subblocks. We can rewrite (2.1) as 



{A,B} = Y,Ui,W 

1=1 \ 



ix^ 



SA 5B 



where i labels the different subblocks and the greek indices run over the size of the ith 

subblock. Each of the subbrackets { , } i depends on different fields. In particular, if 

the functional C is a Casimir, then, for any functional F 
r 

{F,C} = Y,{F,C} t = => {F,C} 1 = 0, i = l,...,r. 
i=i 

The solution for this is 

C[£] = Ci[£i] + -" + C r [£r], where {F,Q}. = 0, i = l,...,r, 

that is, the Casimir is just the sum of the Casimir for each subbracket. Hence, the 
question of finding the Casimirs can be treated separately for each component of the 
direct sum. We thus assume we are working on a single degenerate subblock, as we 
did for the classification in Chapter 4, and henceforth we drop the subscript i. 

There is a complication when a single (degenerate) subblock has more that 
one simultaneous eigenvector. By this we mean k vectors vS a \ a = 1, . . . ,k, such 
that 

Wa"Mi*W = AM u[ a) . 
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Note that lower-triangular matrices always have at least the eigenvector given by u^ = 5^ 
Let 7]( a ' := Up £ p , and consider a function C(rj^ , . . . ,7/"). Then 



d C „ r „/,a sr^ (a) (6) ^ ^ 



w ^ ) 7m^ = W ^ U) Y.^'" 



a, 6=1 



AM ^ u ( , a) u , 



a,6=l 



"A <*ff 



dnW&nW 



Because the eigenvalue AM does not depend on a (the block was assumed to have 
degenerate eigenvalues), the above expression is symmetric in A and a. Hence, the 
Casimir condition (5.6) is satisfied. 

The reason this is introduced here is that if a degenerate block splits into 
a direct sum, then it will have several simultaneous eigenvectors. The Casimir in- 
variants C^ a '{j]^ a ') and C^ '(r/' ') corresponding to each eigenvector, instead of adding 
as C( a '(r]( a >) + C' '(?/ '), will combine into one function to give C(r]( a ',r)( '), a more 
general functional dependence. However, these situations with more than one eigen- 
vector are not limited to direct sums. For instance, they occur in semidirect sums. 
In Section 5.6 we will see examples of both cases. 

5.3 Local Casimirs for Solvable Extensions 

In the solvable case, when all the W^' 7 s are lower-triangular with vanishing eigenval- 
ues, a special situation occurs. If we consider the Casimir condition (5.5), we notice 
that derivatives with respect to £ n do not occur at all, since W^ n ' = 0. Hence, the 
functional 



C[£] = [ fV) <5(x - x') dV = £ n (x) 
Jn 



is conserved. The variable £ n (x) is locally conserved. It cannot have any dynamics 
associated with it. This holds true for any other simultaneous null eigenvectors 
the extension happens to have, but for the solvable case £ n is always such a vector 
(provided the matrices have been put in lower-triangular form, of course). 
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Hence, there are at most n — 1 dynamical variables in an order n solvable 
extension. An interesting special case occurs when the only nonvanishing Wt„\ is 
for /i = n. Then the Lie-Poisson bracket is 

n-l 



It, A. ,. 

{f,g}=y^ WrT I r(x) 

^=1 Jn 



SF 



5G 



<5£m( x ) ' ,j^(x) 



d 2 x, 



where £ n (x) is some function of our choosing. This bracket is not what we would 
normally call Lie-Poisson because £™(x) is not dynamical. It gives equations of 
motion of the form 



f = W n v » 



6 JL «. 

6^ 



which can be used to model, for example, advection of scalars in a specified flow given 
by £ ra (x). This bracket occurs naturally when a Lie-Poisson bracket is linearized [58, 
69]. 

5.4 Solution of the Casimir Problem 

We now proceed to find the solution to (5.5). We assume that all the W^\ [i = 
0, . . . , n, are in lower-triangular form, and that the matrix W^> is the identity matrix 
(which we see saw can always be done). Though this is the semidirect form of the 
extension, we will see that we can also recover the Casimir invariants of the solvable 
part. We assume v > in (5.5), since v = does not lead to a condition on the 
Casimir (Section 5.1). Therefore W\ nu = 0. Thus, we separate the Casimir condition 
into a part involving indices ranging from 0, . . . , n — 1 and a part that involves only n. 
The condition 



becomes 



A=0 






0, v > 0, 



n-l 



n-l 



Y wx^c^ [e , r j + y w ^ v c 

k U,<T=0 u=0 



fin 



e,e 



o. 
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where we have used W\ nu = to limit the sum on [i. Separating the sum in A, 



n— 1 

£ 

A=0 



n—l 



n— 1 



.fm 



e\r 



n—l 



E W ^ C ,^ [e,e\ +J2 w x^ c ,i 

k fl,(7=0 /1=0 

n— 1 

+ ^ w^c ^ [ e" , r ] + E wr n' M, c )Atn [ r , r ] = o. 

U,CT=0 U=0 

The last sum vanishes because [ £ n , £ n ] = 0. Now we separate the condition into 
semisimple and solvable parts, 

n— 1 f n—l n— 1 



M=l 



E E ^""c,^ [ e , e ] - E w ^^ n \ c , c i 



k A,ct=0 



(T=0 



n-1 



+ e Wn^c,^ [ r , r ] + E w >°%°° 



a=0 



n-1 



A,ct=0 



e\f 



n—l 



n—l 



- e h^c, „ [ c , r ] + E ^°To. [ c , r ] = o. 

(7=0 (T=0 

Using Wo- = 5^, we can separate the conditions into a part for v = n and one 
for < v < n. For v = n, the only term that survives is the last sum 

n-1 

E c Artr,n = o. 

(7=0 

Since the commutators are independent, we have the conditions, 



C, Off = 0, a = 0, ... ,n- 1. 



(5.16) 



and for < f < n, 

n— 1 f n—l 



«=1 



n-1 



E E wvx«r [ ? A ^ ] - E ^""c,^ [ r , r ] 



kA,cr=l 



(T=l 



n-1 



c n[r,n = o, 



(T=l 

where we have used (5.16). Using independence of the inner brackets gives 

WtTC,^ = w^c^ ( 5 - 17 ) 

g^C^ = W^C^ n + 5% C on, (5.18) 
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for < a, A, v, \x < n. From now on in this section repeated indices are summed, and 
all greek indices run from 1 to n — 1 unless otherwise noted. We have written a tilde 
over the W's to stress the fact that the indices run from 1 to n — 1, so that the W 
represent a solvable order (n — 1) subextension of W. This subextension does not 
include Wi n y We have also made the definition 

gT := WrT- (5-19) 

Equation (5.17) is a Casimir condition: it says that C is also a Casimir of W. We 
now proceed to solve (5.18) for the case where g is nonsingular. In Section 5.4.2 
we will solve the singular g case. We will see that in both cases (5.17) follows from 

(5.18). 

5.4.1 Nonsingular g 

The simplest case occurs when g has an inverse, which we will call g . Then 
Eq. (5.18) has solution 

C,r„ = A£ ff C ilin + g T<T C i o n , (5-20) 

where 

A^:=g TV W^. (5.21) 

We now verify that Ara = A„ T , as required by the symmetry of the left-hand side 
of (5.20). 

A^ = g TU W K ^S a K 

= g TU W K ^g^g ap 

n 

K = \ 
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where we used the fact that W n pn = to extend the sum. Then we can use the 
commutativity property (2.21) to interchange p and v, 



A^ = g T JY,W K ^W n ^)g 



up 



= 9rvW n ™g ap W K ^ 
= 5t K A» k 

which shows that A is symmetric in its lower indices. 

In (5.20), it is clear that the nth variable is "special"; this suggests that we 
try the following form for the Casimir: 

c(e,e,-..,e) = ^2W(e,e,-..,e~ 1 )Me), (5.22) 

where / is arbitrary and fi is the ith derivative of / with respect to its argument. One 
immediate advantage of this form is that (5.17) follows from (5.18). Indeed, taking 
a derivative of (5.18) with respect to £ A , inserting (5.22), and equating derivatives 
of / leads to 






)l\ 



where we have used (5.16). Since the left-hand side is symmetric in A and a then so 
is the right-hand side, and (5.17) is satisfied. 

Now, inserting the form of the Casimir (5.22) into the solution (5.20), we can 
equate derivatives of / to obtain for r, a = 1, . . . ,n— 1, 



m = o. 



(°> - 

, x ( 5 - 23 ) 

T){l) — AU T)(l-1) i r. T)^- 1 ) „■ > 1 

The first condition, together with (5.16), says that T>^> is linear in £ , . . . £ n_1 . There 
are no other conditions on I?' ' , so we can obtain n independent solutions by choosing 

D(°)"=£", i/ = 0,...,n-l. (5.24) 
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The equation for pC 1 )" is 

^ = fc V = ": 1 (5-25) 

\ A ra V = l,...,Tl-l. 

Thus £>W 1 ' is a quadratic polynomial (the arbitrary linear part does not yield an 
independent Casimir, so we set it to zero). Note that T>( l > v does not depend on £° 
since r, a = 1, . . . , n — 1. Hence, for i > 1 we can drop the Oq ' term in (5.23). 
Taking derivatives of (5.23), we obtain 

We know the series will terminate because the W^' , and hence the At„\, are nilpo- 
tent. The solution to (5.26) is 

v {i)v = T^yy 4^.... (i+1) e ri r 2 • • • e^ , i > i, (5.27) 

where the constants D are defined by 

In summary, the £> w 's of (5.22) are given by (5.24), (5.25), and (5.27). 

Because the left-hand side of (5.26) is symmetric in all its indices, we require 

A^A^ X = A^A^, i>l. (5.29) 

This is straightforward to show, using (2.21) and the symmetry of A: 

A TG ^flX — A OT ^X P 

= (g (TK w r K n(9x P w/n 

= g aK g Xp W T ^W^ 
= 9a.9x p W T ^W^ 
— A^ A v 

TX IMJ 

If we compare this to (2.21), we see that A satisfies all the properties of an extension, 
except with the dual indices. Thus we will call A the coextension of W with respect 
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to g. Essentially, g serves the role of a metric that allows us to raise and lower indices. 
The formulation presented here is, however, not covariant. We have not been able to 
find a covariant formulation of the coextension, which is especially problematic for 
the singular g case (Section 5.4.2). Since the coextension depends strongly on the 
lower-triangular form of the W^> 's, it may well be that a covariant formulation does 
not exist. 

For a solvable extension we simply restrict v > and the above treatment still 
holds. We conclude that the Casimirs of the solvable part of a semidirect extension 
are Casimirs of the full extension. We have also shown, for the case of nonsingular g, 
that the number of independent Casimirs is equal to the order of the extension. 

5.4.2 Singular g 

In general, g is singular and thus has no inverse. However, it always has a (symmetric 
and unique) pseudoinverse g„ v such that 

9^9 aT 9rv = 9^v, (5-30) 

sr§ a r9 TV = sr- (5-3i) 

The pseudoinverse is also known as the strong generalized inverse or the Moore- 
Penrose inverse [79]. It follows from (5.30) and (5.31) that the matrix operator 

P\:=g™g KT 

projects onto the range of g. The system (5.18) only has a solution if the following 
solvability condition is satisfied: 

P\ {W^C^n + 6\ Con) = W a ^C ifin + 6% C „; (5.32) 

that is, the right-hand side of (5.18) must live in the range of g. 

If C on 7^ 0, the quantity W a Vil C^ n + 5 U ' a C$ n has rank equal to n, because the 
quantity W a vtl C un is lower-triangular (it is a linear combination of lower-triangular 
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matrices). Thus, the projection operator must also have rank n. But then this 
implies that g has rank n and so is nonsingular, which contradicts the hypothesis 
of this section. Hence, Con = for the singular g case, which together with (5.16) 
means that a Casimir that depends on £° can only be of the form C = /(£°). However, 
since £° is not an eigenvector of the W^'s, the only possibility is C = £°, the trivial 
linear case mentioned in Section 5.1. 

The solvability condition (5.32) can thus be rewritten as 

{P\ W^ - W a ^) C pn = 0. (5.33) 

An obvious choice would be to require P V T W ^^ = W a v ^, but this is too strong. We 
will derive a weaker requirement shortly. 

By an argument similar to that of Section 5.4.1, we now assume C is of the 
form 

c(e,...,e) = ^ {i He,...,e- 1 )MC), (5.34) 

where again fa is the ith derivative of / with respect to its argument. As in Sec- 
tion 5.4.1, we only need to show (5.18), and (5.17) will follow. The number of 
independent solutions of (5.18) is equal of the rank of g. The choice 

V^ u = P u p e, v = l,...,n-l, (5.35) 

provides the right number of solutions because the rank of P is equal to the rank 
of g. It also properly specializes to (5.24) when g is nonsingular, for then P u p = b v ' p . 

The solvability condition (5.33) with this form for the Casimir becomes 

{P\ W a Tfl - W a Ufl ) 2?W" = 0, i > 0. (5.36) 

For i = the condition can be shown to simplify to 

P\W a T> " = W CT UT P^ T , 
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or to the equivalent matrix form 

PW {a) =W {(T) P, (5.37) 

since P is symmetric [79]. 

Equation (5.18) becomes 

g*»V$> = o, 

g*n<pV£ = W K ' U D§- 1 >, i > 0. 

If (5.33) is satisfied, we know this has a solution given by 

V% = 9x P ^ V[fT 1)u + (*a" - g Xp 9 P ») Sfc 1 *, i > 0, 

where £ is arbitrary, and (5^ — 9x P 9 p ^) projects onto the null space of g. The left- 
hand side is symmetric in A and a, but not the right-hand side. We can symmetrize 
the right-hand side by an appropriate choice of the null eigenvector, 



in which case 



v ( $;=ALv i r 1)v , *>o, 



where 

A u Xa := g ffp wr + g Xp Wa pv - g Xp g„ K g pt " W™ , (5.38) 

which is symmetric in A and a. Equation (5.38) also reduces to (5.21) when g is 
nonsingular, for then the null eigenvector vanishes. The full solution is thus given in 
the same manner as (5.26) by 

(} + 1)! 1^-^+1) ? ? ? ' ' \O.GV) 

where the constants D are defined by 

nW" — 4M1 4M2 . . . 4 / *(*- 1 ) AVi p v (K A(\\ 
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and 2?( ' is given by (5.35). 

The ^4's must still satisfy the coextension condition (5.29). Unlike the non- 
singular case this condition does not follow directly and is an extra requirement in 
addition to the solvability condition (5.36). Note that only the i = case, Eq. (5.37), 
needs to be satisfied, for then (5.36) follows. Both these conditions are coordinate- 
dependent, and this is a drawback. Nevertheless, we have found in obtaining the 
Casimir invariants for the low-order brackets that if these conditions are not satis- 
fied, then the extension is a direct sum and the Casimirs can be found by the method 
of Section 5.2. However, this has not been proved rigorously. 

5.5 Examples 

We now illustrate the methods developed for finding Casimirs with a few examples. 
First we treat our prototypical case of CRMHD, and give a physical interpretation of 
invariants. Then, we derive the Casimir invariants for Leibniz extensions of arbitrary 
order. Finally, we give an example involving a singular g. 

5.5.1 Compressible Reduced MHD 

The W tensors representing the bracket for CRMHD (see Section 2.2.4) were given 
in Section 2.3.2. We have n = 3, so from (5.19) we get 

In this case, the coextension is trivial: all three matrices A^ v > defined by (5.21) 
vanish. Using (5.22) and (5.24), with v = 1 and 2, the Casimirs for the solvable part 
are 

C 1 = e 9(f) =vg{i>), C 2 = e h(f) =phty), 

and the Casimir associated with the eigenvector £ 3 is 

C 3 = k(f) = fcty). 
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Since g is nonsingular we also get another Casimir from the semidirect sum part, 

Pe Pe 

The physical interpretation of the invariant C 3 is given in Morrison [68] and 
Thiffeault and Morrison [90]. This invariant implies the preservation of contours 
of ip, so that the value ipQ on a contour labels that contour for all times. This is a 
consequence of the lack of dissipation and the divergence-free nature of the velocity. 
Substituting C 3 (tp) = ip k we also see that all the moments of the magnetic flux are 
conserved. By choosing C 3 (t/0 = 0(V'( X ) — tpo), a heavyside function, and inserting 
into (5.4), it follows that the area inside of any ^-contour is conserved. 

To understand the Casimirs C 1 and C 2 , we also let g(ip) = @(ip — ipo) in C 1 . 
In this case we have 

C>;V>] = f vg(ip)d 2 x = [ v(x)d 2 x, 

where ^o represents the (not necessarily connected) region of £1 enclosed by the 
contour ip = tfjQ and d^&o is its boundary. By the interpretation we gave of C 3 , the 
contour d*$>o moves with the fluid. So the total value of v inside of a ^-contour is 
conserved by the flow. The same is true of the pressure p. (See Thiffeault and Mor- 
rison [90] for an interpretation of these invariants in terms of relabeling symmetries, 
and a comparison with the rigid body.) 

The total pressure and parallel velocity inside of any -(/'-contour are preserved. 
To understand C 4 , we use the fact that uj = V 2 (p and integrate by parts to obtain 



C 4 [uj,v,p,ip] 



^(Wvv> + ^V(V0d 2 *. 



The quantity in parentheses is thus invariant inside of any ^-contour. It can be 
shown that this is a remnant of the conservation by the full MHD model of the cross 
helicity, 

V = [ v • B d 2 x , 
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at second order in the inverse aspect ratio, while the conservation of C 1 [v ;ip] is a 
consequence of preservation of this quantity at first order. Here B is the magnetic 
field. The quantities C 3 [V>] and C 2 \p;ip] they are, respectively, the first and second 
order remnants of the preservation of helicity, 

W = f A-Bd 2 x, 
Jn 

where A is the magnetic vector potential. 

5.5.2 Leibniz Extension 

We first treat the nilpotent case. The Leibniz extension of Section 4.5 can be char- 
acterized by 

Wx ^ = S x ^, n,u,X = l,...,n, (4.19) 

where the tensor 6 is an ordinary Kronecker delta. Upon restricting the indices to 
run from 1 to n — 1 (the tilde notation of Section 5.4), we have 

The matrix g is nonsingular with inverse equal to itself: g„ v = 5™ +u . The coextension 
of W is thus 

n— 1 n— 1 

Equation (5.28) becomes 

± ^T 1 T 2 ...T {l+1) J ^T 1 T 2 ^H1T 3 " r V*(i-2)T» ^/i(i_l)T( i+ l) 
r/il+n eM2+™ r^(»-l)+n <rv+n 

~ °t 1 +t 2 °m+T 3 ' ' ' °H(i- 2 )+n °/*(i_i)+T(i+i) 

— j) u + in ;/ — 1 n — 1 

which, as required, this is symmetric under interchange of the Tj. Using (5.22), (5.24), 
(5.25), and (5.27) we obtain the n — 1 Casimir invariants 

V(t\ ■ ■ ■ , H = £ 7-TTV C + i r 2+ -. + r (i+1) r ■ ■ ■ ^ +1 > /r(D, (5-42) 
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for v = 1, . . . ,n— 1. The superscript v on / indicates that the arbitrary function is 
different for each Casimir, and recall the subscript i denotes the ith derivative with 
respect to £ n . The nth invariant is simply C u (£ n ) = / n (£ n ), corresponding to the 
null eigenvector in the system. Thus there are n independent Casimirs, as stated in 
Section 5.4.1. 

For the Leibniz semidirect sum case, since g is nonsingular, there will be an 
extra Casimir given by (5.42) with u = 0, and the Tj sums run from to n — 1. This 
is the same form as the v = 1 Casimir of the order (n + 1) nilpotent extension. 

For the ith term in (5.42), the maximal value of any Tj is achieved when all 
but one (say, t\) of the Tj are equal to n — 1, their maximum value. In this case we 
have 

T\ + T2 H V n+i = r\ + i(n — 1) = v + in, 

so that Ti = i + v. Hence, the ith term depends only on (^ +i , . . . , £ ra ) , and the vth. 
Casimir depends on (£", . . . , £ n ). Also, 

max (ri H h r i+ i) = (i + l)(n — 1) = v + in, 

which leads to maxi = n — v — 1. Thus the sum (5.42) terminates, as claimed in 
Section 5.4.1. We rewrite (5.42) in the more complete form 

n—v 1 

r r , . . . , o = £ tj C + ( r 2 + 1)n + r, r ■ ■ ■ e /£_i(o, 
fc=i 

for i/ = 0, . . . , n. Table 5.1 gives the v = \ Casimirs up to order n = 5. 

5.5.3 Singular 5 

Now consider the n = 4 extension from Section 4.6.4, Case 3c. We have 




W {2) =000, g 
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n 


Invariant 


1 


/(e 1 ) 


2 


m 2 ) 


3 


em+ueme) 


4 


ef(?)+een?)+Mt 3 ) 3 f"(?) 


5 


m 5 ) + (ee + urn ne) + K 3 (e 4 ) 2 r (a + W) 4 f"(z 5 ) 



Table 5.1: Casimir invariants for Leibniz extensions up to order n = 5 {y = 1). The 
primes denote derivatives. 

with Wm = W(3) = 0. The pseudoinverse of g is g = g and the projection operator 

is 

/l 0\ 
P%:=g™g KT = 0. 
\0 1/ 

The solvability condition (5.37) is obviously satisfied. We build the coextension given 

by (5.38), which in matrix form is 

to obtain 

/0 0\ 
A& =0 1 , A& = aw = o. 
\0 1 0/ 

These are symmetric and obviously satisfy (5.29), so we have a good coextension. 

Using (5.34), (5.35), (5.39), and (5.40) we can write, for v = 1 and 3, 

C 1 = ?/(?)+???(?), 

This extension has two null eigenvectors, so from Section 5.2 we also have the 
Casimir /i(£ 2 ,£ ). The functions /, g, and h are arbitrary, and the prime denotes 
differentiation with respect to argument. 
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5.6 Casimir Invariants for Low-order Extensions 

Using the techniques developed so far, we now find the Casimir invariants for the 
low-order extensions classified in Section 4.6. We first find the Casimir invariants 
for the solvable extensions, since these are also invariants for the semidirect sum 
case. Then, we obtain the extra Casimir invariants for the semidirect case, when 
they exist. 

5.6.1 Solvable Extensions 

Now we look for the Casimirs of solvable extensions. As mentioned in Section 5.3, the 
Casimirs associated with null eigenvectors (the only kind of eigenvector for solvable 
extensions) are actually conserved locally. We shall still write them in the form C = 
/(£"), where C is as in (5.4), so they have the correct form as invariants for the 
semidirect case of Section 5.6.2 (for which they are no longer locally conserved). 

n=l 

Since the bracket is Abelian, any function C = C^ 1 ) is a Casimir. 

n=2 

For the Abelian case we have C = C^ 1 ,^ 2 ). The only other case is the Leibniz 
extension, 

-1 <-2\ tlfft2 



c(e,n = ef(n+g(n- 



n=3 



As shown in Section 4.6.3, there are four cases. Case 1 is the Abelian case, for which 
any function C = C(^ 1 ,^ 2 ,^ 3 ) is a Casimir. Case 2 is essentially the solvable part of 
the CRMHD bracket, which we treated in Section 5.5.1. Case 3 is a direct sum of 
the Leibniz extension for n = 2, which has the bracket 

[(a l ,a 2 ),((3 1 ,(3 2 )] = (0, [«i , /?i ]), 
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Case Invariant 



i c(e,e,e) 

3 ef(e)+g(e,e) 

4 em 3 ) + K£ 2 ) 2 /'(£ 3 ) + m 3 ) + Ma 

Table 5.2: Casimir invariants for solvable extensions of order n = 3. 

with the Abelian algebra [ as , (3% ] = 0. Hence, the Casimir invariant is the same 
as for the n = 2 Leibniz extension with the extra £ 3 dependence of the arbitrary 
function (see Section 5.2). Finally, Case 4 is the Leibniz Casimir. These results are 
summarized in Table 5.2. 

Cases 1 and 3 are trivial extensions, that is, the cocycle appended to the 
n = 2 case vanishes. The procedure of then adding £ n dependence to the arbitrary 
function works in general. 

n=4 

As shown in Section 4.6.4, there are nine cases to consider. We shall proceed out of 
order, to group together similar Casimir invariants. 

Cases la, 2, 3a, and 4a are trivial extensions, and as mentioned in Sec- 
tion 5.6.1 they involve only addition of £ 4 dependence to their n = 3 equivalents. 
Case 3b is a direct sum of two n = 2 Leibniz extensions, so the Casimirs add. 

Case 3c is the semidirect sum of the n = 2 Leibniz extension with an Abelian 
algebra defined by [ («3, ay) , (/%, $4) ] = (0, 0), with action given by 

P(ai,a 2 )(/ 3 3,/54) = (0, [ Ql , /3 3 ])• 

The Casimir invariants for this extension were derived in Section 5.5.3. 
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Case Invariant 



la C(£\£ 2 ,e\£ 4 ) 

lb m 4 ) + £ 2 s(£ 4 ) + e 3 M£ 4 ) + ^ 4 ) 

2 m 3 ) + e 2 s(a + M£ 3 ,0 

3a ef(e)+g(e,e,e) 

3b ^/(a + e 3 5(e 4 ) + Me 2 ,e 4 ) 

3c ^/(e 4 ) + eem 4 ) + e 3 5(e 4 ) + ne, e) 

3d ^/(e 4 ) + wfne) + e 9 (e) + ene) + ne) 

4a eue) + w?f (a +e 9 (e) + Ke,e) 

4b ef{e)+eer{e) + u??f"(^) 
+ eg{e) + \(e?g'{e) + eKe) + ne) 

Table 5.3: Casimir invariants for solvable extensions of order n = 4. 

Case 3d has a nonsingular g, so the techniques of Section 5.4.1 can be applied 
directly. 

Finally, Case 4b is the n = 4 Leibniz extension, the Casimir invariants of 
which were derived in Section 5.5.2. The invariants are all summarized in Table 5.3. 

5.6.2 Semidirect Extensions 

Now that we have derived the Casimir invariants for solvable extensions, we look at 
extensions involving the semidirect sum of an algebra with these solvable extensions. 
We label the new variable (the one which acts on the solvable part) by £ . In 
Section 5.4.1 we showed that the Casimirs of the solvable part were also Casimirs 
of the full extension. We also concluded that a necessary condition for obtaining a 
new Casimir (other than the linear case C(£°) = £°) from the semidirect sum was 
that det Wt n \ / 0. We go through the solvable cases and determine the Casimirs 
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associated with the semidirect extension, if any exist. 

n=l 

There is only one solvable extension, so upon appending a semidirect part we have 

Since det Wm ^ 0, we expect another Casimir. In fact this extension is of the 
semidirect Leibniz type and has the same Casimir form as the n = 2 solvable Leibniz 
(Section 5.5.2) extension. Thus, the new Casimir is just £°/(s )■ 

n=2 

Of the two possible extensions only the Leibniz one satisfies det Wr 2 \ 7^ 0. The 
Casimir is thus 

Csd = m 2 ) + ^ma. 

n=3 

Cases 2 and 4 have a nonsingular Wr%y The Casimir for Case 2 is 

c sd =ef(e)+eer(e), 

and for Case 4 it is of the Leibniz form 

n=4 

Cases lb, 3d, and 4b have a nonsingular Wuy The Casimirs are shown in Table 5.4. 
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Case Invariant 



3d m 4 ) + (^ 2 + ^(e 3 ) 2 ) /'(O + &k 2 )W) 

4b eat 4 ) + (^ 3 + ^(e 2 ) 2 ) ne) + heiefne) + uw(?) 



Table 5.4: Casimir invariants for semidirect extensions of order n = 5. These exten- 
sions also possess the corresponding Casimir invariants in Table 5.3. 



Chapter 6 
Stability 



In this chapter we discuss the general problem of stability of steady solutions of Lie- 
Poisson systems, for different classes of Hamiltonians. We first define, in Section 6.1, 
what we mean by a steady solution being stable. We review the different types 
of stability and discuss how they are related. In Section 6.2 we discuss the energy- 
Casimir method for finding sufficient conditions for stability, and demonstrate its use 
by a few examples. The energy- Casimir method for fluids uses an infinite-dimensional 
analogue of Lagrange multipliers to find constrained extrema of the Hamiltonian 
(extrema of the free energy) . 

In Section 6.3 we turn to a different method of establishing stability, that of 
dynamical accessibility. The technique involves restricting the variations of the en- 
ergy to lie on the symplectic leaves of the system. It is more general that the energy- 
Casimir method since it yields all equilibria of the equations of motion. The dynam- 
ical accessibility method is closely related to the energy- Casimir method, which we 
will see is reflected in the fact that the concept of coextension of Chapter 5 is used 
in the solution. 

For the different types of extensions, we derive as general a result as possible, 
and then specialize to particular forms of the bracket and Hamiltonian, until usable 
stability conditions are obtained. We will treat CRMHD in detail, using both the 
energy-Casimir and dynamical accessibility methods. 
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6.1 The Many Faces of Stability 

A somewhat universally accepted definition of stability is as follows: Let £ e be an 
equilibrium solution of the (not necessarily Hamiltonian) system 

i = HO, (6-i) 

i.e., J~(£ e ) = 0- The system is to said to be nonlinearly stable, or simply stable, if for 
every neighborhood U of £ e there is a neighborhood V of £ c such that trajectories £(£) 
initially in V never leave U (in finite time). 

In terms of a norm ||-||, this definition is equivalent to demanding that for 
every e > 0, there is a 5 > such that if ||£(0) — £ e || < S, then ||£(£) — £ e || < e for 
all t > 0. 

We also consider the linearized system, 

^ , (6.2) 



«-<«■« 



where <5£ is an infinitesimal perturbation. From this we define the formally self- 
adjoint linear operator 5 by 



{»il-«K):-(6 V ,^5C 



(6.3) 



From this definition we distinguish four basic types of stability: 

Spectral stability. The linearized system (6.2) is spectrally stable if the spectrum 
of the linear operator J defined by (6.3) has no eigenvalue with a positive 
real part. A special case is neutral stability, for which the spectrum is purely 
imaginary. Hamiltonian systems are neutrally stable if they are spectrally 
stable. 

Linear stability. If the linearized system (6.2) is stable according to the above 
definition, then the system (6.1) is said to be linearly stable (or linearized 
stable). This implies spectral stability. 
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Spectral Stability 



Linearized Stability 



Nonlinear Stability 



finite- 
dim 



Formal Stability 



Figure 6.1: Relationship between the different types of stability. 

Formal stability (Dirichlet criterion). The equilibrium is formally stable if we 
can find a conserved quantity whose first variation vanishes when evaluated at 
the equilibrium, and whose second variation is positive (or negative) definite 
when evaluated at the same equilibrium. In finite dimensions, this implies 
nonlinear stability. When the system is Hamiltonian and separable (i.e., it can 
be written as a sum of kinetic and potential energy), this criterion becomes 
Lagrange's theorem. 

Nonlinear stability. This is just the nonlinear stability of the full system as 
defined above. Note that this only implies that there exists a sufficiently 
small neighborhood V such that trajectories never leave U. It does not im- 
ply absence of finite- amplitude instability, called nonlinear instability by some 
authors, which says that the system is unstable for large enough perturbations. 



Figure 6.1 summarizes the relationships between the various types of stabil- 
ity. See Siegel and Moser [85], Holm et al. [38], or Morrison [69] for examples and 
counterexamples of these relationships. 

We have stated that formal stability implies nonlinear stability for finite- 
dimensional systems. Before discussing this point, we prove a stability theorem for 



finite-dimensional systems that has its origins with Lagrange. It was proved in a less 
general form than presented here by Dirichlet [53] , and was subsequently generalized 
by Liapunov. 

The theorem is as follows. If the system (6.1), in finite dimensions, has a 
constant of the motion & that has a relative extremum in the strong sense [27, p. 13] 
at the equilibrium point £ = £ c , then the equilibrium solution is stable. 1 

We follow the proof of Siegel and Moser [85, p. 208]. See also Hirsch and 
Smale for a thorough treatment [37]. Since & can be replaced by —R, we can assume 
it has a minimum without loss of generality. By the strong minimum hypothesis, 
there exists a g > such that 

£(&) < it(0 whenever ||£ - £J < g, (6.4) 

for some norm ||-||. Now, let 

We ■■={£{ ||£- £c|| <£}, <e<g, 

be a ball of radius e around the equilibrium point. Let /z(e) be the minimum value 
of & on the surface of the ball 9Jt e , 

fi(e) := min .£(£). 

||£-£e||=£ 

Using the strong minimum hypothesis, (6.4), we have 

£(£)</z(e), for£e9JT e . 
Now consider a trajectory with initial conditions £(0) in 9Jt e . Then 

*(£(*)) =*(£(<))) </z(e). 

But by continuity this implies £(£) G 9Jt £ since otherwise we would have had £(£(£)) > //(e) 
at some point in the trajectory. Thus, £(£) lies in 9Jt £ whenever £(0) does. We then 



1 In finite dimensions a strong minimum is just a minimum with respect to the usual Euclidean 
norm, ||(|| = |C|. 
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have stability, because Wl £ is a neighborhood of £ e and we can make e as small as we 
want. 

In finite dimensions, positive or negative definiteness of the second variation 
of & is sufficient for the strong minimum requirement (6.4). In infinite dimensions this 
is not the case [8,11,23,27,38,62,84]. Further convexity arguments must be made, as 
done for several physical systems in Holm et al. [38]. Another crucial requirement, 
which is immediate in finite dimensions, is that the invariant & be continuous in the 
norm ||-||. In general an infinite-dimensional minimum will not necessarily satisfy 
this condition [27,38]. 

Ball and Marsden [11] give an example from elasticity theory of a system that 
is formally stable but is nonlinearly unstable. Finn and Sun [23] discuss additional 
requirements for nonlinear stability of an ideal fluid in a gravitational field (for an 
exponential atmosphere), which is formally stable. One does not know how strin- 
gent these requirements are — they could be far from the actual instability threshold. 
We take the viewpoint here that establishing definiteness of the second variation — 
showing formal stability — is a good indicator of stability. Indeed, formal stability is 
often used to mean stability, as is the case with 5W stability criteria in MHD, which 
are actually second-order variations of the potential energy. For the Grad-Shafranov 
equilibria of reduced MHD (no flow) , the sufficient conditions for formal stability are 
the same as for nonlinear stability [38, pp. 41-43]. 

It will be the topic of future work to try and make these general stability 
conditions more rigorous by making more stringent convexity arguments. Certainly 
formal stability implies linearized stability, since the second variation of the constant 
of motion provides a norm (conserved by the linearized dynamics) that can be used 
to establish stability of the linearized system. 

Finally, note that Dirichlet's theorem does not imply that if F does not 
have an extremum at £ e , then the system is unstable. It gives a sufficient, but not 
necessary, condition for stability of an equilibrium. 



90 



6.2 The Energy-Casimir Method 

The energy- Casimir method has a long history which dates back to Fjortoft [24], 
Newcomb [93], Kruskal and Oberman [49], Fowler [25], and Gardner [26], but is 
usually called "Arnold's method" or "Arnold's theorem" [3-5,7,8]. We illustrate the 
method for a Lie-Poisson system. The equations of motion for the field variables £ 
in terms of a given Hamiltonian H are 



- e 



(2.6) 






t 



This can be rewritten 

i = - 

where C is any function of the Casimirs. It follows that if 

5{H + C)[Q=:5F[Q=^ 

then £ e is an equilibrium of the system. We call F the free energy. The free energy F 
is a constant of the motion whose first variation vanishes at an equilibrium point. 
Therefore, if we can show it also has a strong extremum at that point then we have 
proved stability, by the theorem of Dirichlet. Showing that S 2 F is definite (that is, 
showing formal stability) is almost sufficient to show stability, in the sense discussed 
at the end of Section 6.1. 

We now apply the energy- Casimir method to compressible reduced MHD. We 
will give more examples in Section 6.3 when we introduce the method of dynamical 
accessibility, which is more general and includes the energy- Casimir result as a special 
case. 

6.2.1 CRMHD Stability 

The free energy functional F is built from the Hamiltonian (2.15) and the Casimir 
invariants found in Section 5.5.1, 

F[u,v,p,iP]:=H + C, 
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C = (fM+vgty) +Ph(i>) + (ukW-p^pvtfty))) 

is a combination of the Casimirs of the system. We use the same angle brackets as 
for the pairing, without the comma, to denote an integral over the fluid domain (we 
assume that we have identified q and q*). 

Equilibrium Solutions 

We seek equilibria of the system that extremize the free energy. The first variation 
of F yields 

SF = /(-<£ + k(if>)) Sco + (v + gi^-Pe^pk'iip)) Sv 

+ (p c - 1 (p-2[3 c x) + h(7p)-(3 c - 1 vk'(i>)} 5p 
+ [-J + f'{i>)+vg , {ij)+ P ti^) + (u J k , {i;)-p c - 1 P vk"{i;))} 5 A 

An equilibrium solution (u) e ,v e ,p e , tp c ) for which 8F = must therefore satisfy 

<Pe = *(l/>e), (6.5) 

v e = (3 e - 1 p e &(ij e )-g(i> c ), (6.6) 

Pe = V e $'(Ve) + Pe(2x - h(l/> 9 )), (6.7) 

J e = f'(^ e )+v c g'(^ c ) +p c ti(^ c )+tu c &(ij e )-p e - l p e v e <S>"(^ e ), (6.8) 

where we have defined 3>(V0 := k(ip). 

Since <p e = <&(ip e ), we have V</> e = &(ip e ) Vip c . Hence, v c j_ = &'(ip e ) B e j_, so 
the perpendicular (poloidal) velocity and magnetic field are collinear at an equilib- 
rium. 

We can use (6.6) and (6.7) to solve for v c and p e , 

M _ (W{ip c )\ 2 y 1 ( g^c) + (h^ c )-2x)^(ip c ) \ 



Pc J V & J \g{^ c )<$>i{^ c )+f3 c {h{i) c )-2x) 
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except where ^'(^e)! 2 = Pe- This singularity represents a resonance in the system, 
about which we will say more later. Equation (6.9) implies 

(V^e - 2$'(V>e) (1 " A.- 1 !*'^)! 2 )" 1 *) X VVc = o, 
V Pc - 2/3 c (1 - &" 1 IS'tye)! 2 )" 1 *) X We = 0. 



An important class of equilibria are given by 

where c is a constant. We call those Alfvenic solutions. (The true Alfven solutions are 
the particular case with c = ±1.) We then have tu e $'(?/> e ) = Je/c 2 , and so from (6.8) 

U " ^) J e = /'(^e) +V e g\4> e ) + Pe ti(4> e ). (6.10) 

Note that, because of (6.9), the right-hand side of (6.10) depends explicitly on x, 
unless we have 

g'(^ e ) = -f3 e cti(^ e ), (6.11) 

in which case (6.10) simplifies to 

(* " ^) J c(^) = f'ty*) ~ </WO </(V>e)- (6-12) 

Such an equation, with no explicit independence on x, has an analogue in low-beta 
reduced MHD, but cannot occur for a system like high-beta reduced MHD [35, p. 59] 
without a vanishing pressure gradient. Here, with CRMHD, we can eliminate the x 
dependence because we can set up an equilibrium gradient in the parallel velocity 
which cancels the pressure gradient. 

If in (6.10) we let 

/'WO - g{l/>e)sf(ll>e) = (l " ^) exp(-2^e), 

then we have the particular solution 



ip e (x,y) = ln(acoshy + \/a 2 — 1 cosx). (6.13) 

This solution, the Kelvin-Stuart cat's eye formula [15,22,82], is plotted in Figure 6.2. 
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12 3 4 5 6 

Figure 6.2: Contour plot of the magnetic flux ip e {x, y) for the cat's eye solution (6.13), 
with a = 1.5. 

Formal Stability 

The second variation of F is given by 

5 2 F = l-Su {V 2 )' l 5uj + \5v\ 2 + ^-\5p\ 2 - Sip (V 2 )' 1 ^ + 2k'{ip) Su Sip 

+ [f"(i>) + vg"(4>)+ P h"(i>)+ujk"(4,)-p c - 1 P vk'"(4>)} \54>\ 2 

+ 2 (g'W-f3 c - 1 pk"(iP)) Si/jSv + 2 f h'(i/>) - AT 1 v fc"(V0) 6ip5p 

-2[5 c ~ 1 k l {^)5v5p). 



We want to determine when this is non-negative. Using 5lu = X7 2 5(f), we can write 

(\V5(f)\ 2 + |V#| 2 + 2fc'(V>) (V 2 5(f)) Sip) 

= (|V<50| 2 + |V#| 2 - 2V(*/(^) Sip) • V5(f) 

= (| V<ty - V(k'(iP) #)| 2 - |V(fc'(^) 5i))\ 2 + |V(#)| 2 ) , 



94 

which, after expanding the \V(k'(ip) 5ip)\ 2 term, becomes 

(\V5(f>\ 2 + |V#| 2 + 2*/^) (V 2 <50) <5^> = 

<|V<50 - V(A/(^) S^)\ 2 + (1 - |A;'(V>)| 2 )|V#| 2 

+ A/(V>)V 2 £; / (^)|<^| 2 ), (6.14) 

so that the second variation, evaluated at the equilibrium solution (6.5)-(6.8), is now 
S 2 F C = /|V<ty - V(*'(Ve) ^)| 2 + (1 - |$'(Vc)| 2 ) W| 2 + H 2 + 1|^| 2 

+ 2 (y(Ve)-ArV$"(^e)) <ty<fo + 2 (^e)-^- 1 ^*"^)) <ty *P 

+ e(x,y)^\ 2 -2p c - 1 ^^ c )5v5p\ (6.15) 
where 

9(x,y) := /"(&)+ Wty>e) +Pc^'(V'e) 

+ C^ c $"(^ e ) - &" 1 p c Vc $"'(Ve) + S'ty-e) Wty e ). 

For positive-definiteness of (6.15), we require 

|$'(^ c )| < 1. (6.16) 

If we have equality in (6.16), then we obtain a family of marginally stable equilibria, 
the Alfven solutions. 

Assuming (6.16) is satisfied, a sufficient condition for stability is to show that 
the (Sv, Sp, Sip) part of the second variation is non-negative. We thus demand the 
quadratic form represented by the symmetric matrix 

-f3 c - 1 $'(^ e ) &" 1 /l'(Ve) " AT* «c $"(Ve) 

be non-negative. A necessary and sufficient condition for this is that the principal 
minors of the matrix be non-negative. The principal minors are simply the deter- 
minants of the submatrices of increasing size along the diagonal. Thus, the first two 
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principal minors are 
IH = |1| >0, 

^2 = 



1 -Pe-'&ty, 



m^)\ 



i ( -Mi-^^)>o. 



-AT 1 $'(VO /3c" 1 

and the third is just the determinant of the matrix, 

Ma = ^2 (@(x,y) - [</(Y> e ) - A-^of"^)] 2 ) 

- [h'ty e ) + /3c" 1 </(V>e) *'(&) " AT^e + AT* Pe *'(&))] ' > 0. 

Combining (6.16) with the requirement [12 > 0, we have 

|$'(VO| 2 <min(l,/? e ). (6.17) 

According to this condition, for /? e < 1 CRMHD is less stable than the RMHD case. 
This is a direct manifestation of the nontrivial cocycle in the bracket: there is a new 
resonance, associated with the acoustic resonance, so-named because at that point 
the flow velocity equals the ion-acoustic speed (proportional to 2T e ). We will see in 
Section 6.3.6 that new resonances are a generic feature of Lie-Poisson systems with 
cocycles. 

The condition that ^3 be non-negative is of a more complicated form. For the 
Alfvenic case, with <&(ip e ) = c~ l ip e (x, y), and assuming condition (6.11), so that J e = 
J e (ip e ), the condition ^3 > simplifies to 

M2 (l " ^) Jc'(lpc) > 0. 

Since ^2 > and, by (6.17), 1/c 2 < min(l,/3 c ), we can simply write 

Jc'(V'c) > 0. (6.18) 

Hence, for /3 e > 1, Alfvenic solutions have the same stability characteristics as for 
RMHD. 
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6.3 Dynamical Accessibility 

We turn now to a different method of finding equiiibria and ascertaining their sta- 
bility. Finding the solutions for which the first variation of the free energy vanishes 
yields some, but not all of the equilibria of the equations of motion. For example, this 
method fails to detect the static equilibrium of the heavy top [69] . For the 2-D Euler 
system, the equilibria it yields are those for which the streamfunction is a monotonic 
function of the vorticity, but there are equilibria which do not have this form. This 
is tied to the rank-changing of the cosymplectic form: there are equilibria that arise 
because the bracket itself vanishes [69] . The method of dynamical accessibility was 
used by Morrison and Pfirsch to examine the stability of the Vlasov-Maxwell sys- 
tem [74,75]. Isichenko [43] made use of a similar method to study hydrodynamic 
stability, based on ideas of Arnold [6] . 

We first explain the method of dynamically accessible variations, and then 
apply it to extensions. We derive general results for pure semidirect extensions and 
extensions with a nonsingular g. For both cases, we examine several different types 
of Hamiltonians. 

6.3.1 The Method 

Consider a perturbation defined as 

<^da :={<?,£}, (6.19) 

with the perturbation given in terms of the generating function \ by 

The x are arbitrary "constant" functions (i.e., they do not depend on £, but do 
depend on x). We call (6.19) a dynamically accessible perturbation. The first-order 
variation of the Casimir invariant of the bracket is given by 

5C da = (su , ^) = ({Q >&>%)■ ( 6 - 2 °) 
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If we now assume that the bracket { ,} is of the Lie-Poisson type (Eq. (2.1)), we 
have 



SC, 



da 



x,£] f , 



SC_ 
~8£ 



£ ; 



X 






{G,C} = 0. 



Hence, to first order, Casimirs are unchanged by a dynamically accessible perturba- 
tion. The first-order variation of the Hamiltonian is 



<5#da = $F da 



5H 



ix,ZY,^ 



— Z 



t 



• X 



The variation of the Hamiltonian and of the free energy are the same because they 
differ only by Casimirs. If we look for equilibrium solutions by requiring that 5H^ a = 
for all X) we obtain 



SH 

-^7"(Ce),Ce 



0, 



which is equivalent to looking for steady solutions of the equation of motion (2.6). 

Because we want to establish formal stability, we have to take second-order 
dynamically accessible variations that preserve the Casimirs. If we denote the second- 
order part of the dynamically accessible variation by S 2 £,da, and the first and second 
order generating functions by \ an d X ■> we have 






i{{*». (}.%{{«».*} 



5£ 
1£ 



x 



x 



a) 



,{G {1 \t} 



5C_ 
c 






t sc\ A . .. 



s 2 ^-Uq {1) 



,{^,e}} 



6C_ 



We made use of the fact that (6.20) vanishes identically. In order for <5 2 Cd a to be 



zero, we can set 



V = {^} + ^\{^\e}} 



x (2) ,e 



+ \ 



X 



(1) 



x (1) ,e 



(6.21) 



The second-order dynamically accessible variation of H is 



S 2 H^ = \(s^,UL Sid }j + (p u , S JL 

+<K».f}+H 5<1, -{ e,1, '«}}'f/' 



which upon using (6.21) becomes 



<5 H d& — 2 
+ 






t 6 2 H r 



'6Z6£ 



x (1) ,e 



+ ^ 



X 



(1) 



x (1 \e 






The piece involving \ can be written as 
■ (2) ,lt SH 

J d£ 



- e 



.X 



(2) 



which vanishes when evaluated at an equilibrium of the equations of motion (6.3.1). 
Hence, for purposes of testing stability we may neglect the second-order generating 
function entirely. We therefore drop the superscripts on Q and x, and write 



5 2 K 



^-h([x,if,^:[x^) + h(\x,[x,^ MT 






'^ 



X: 






(6.22) 



To more easily determine sufhcient stability conditions, we want to write (6.22) as 
a function of <5£da- (Then (6.22) will be a quadratic form in <5£da-) We now show 
that this is always possible. This is a generalization of a proof by Arnold [5] for 2-D 
Eulcr. 

Assume that we have a dynamically accessible variation given in terms of a 
second generating function %', 



S'^,= [x'^]\ Ada = Hx',[x',£]' 



t 
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such that (5£da = 5'£da- Then the difference in the second order variation of the energy 
is 



25 H da — 25 H da 



5 2 H 



X- 



[x,£] t ,««[x,£] t + 
[x^^f|[x',£] f + 



6H_ 



, 5_H_ 



X 



\x,CV, 



x, 



5H_ 



[x,£] f , 



x', 



6H_ 



(6.23) 



Using (2.5) and the Jacobi identity in g, we have that for any a, /3, 7 G g and £ G g*, 

([«,e] f ,[/3,7]) = (^[«,[/3,7]]) 

= -(C,([/3,[ 7 ,a]] + [7,[a,/3]])) 

= ([ / 9,e] t ,[«,7])-([7,e] t ,[a,/3])- 

Making use of this identity in the last term of (6.23), we get 



25 H da - 25 i^da 



:x,^ j 



X- 



6H_ 



[x',e] f , 



X 



+ 



6H_ 

5£ ' ? 



, [ X , x' ] 



The first two terms cancel, and from (2.6) we are left with 



25 i^da — 25 i^da 



L [x,x'] 



which vanishes at an equilibrium of the equations of motion, for any \i x' ■ We 
conclude that 5 2 H dSi depends on \ only through 5£da- Thus, it is always possible to 
rewrite 8 2 H d& in terms of only the dynamically accessible perturbations x- 

6.3.2 2-D Euler 

An equilibrium of the equation of motion for 2-D Euler (see Section 2.2.2) satis- 
fies [4> c , w e ] =0. The most general equilibrium solution can thus be written 



4> e = $(it(x)); lo c = fi(u(x)), 
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where tt(x) is an arbitrary function. Contrary to the energy-Casimir result, neither 
the function <I> or 0, need be invertible (i.e., monotonic in their argument). 

We can then examine stability by taking the dynamically accessible second 
variation of the energy. This is given by (6.22) with [x,£ e ] = ~[x, UJ c], 

5 2 H da [u; c ] = ±(lx,u; c ]l(-V- 2 )lx,oJ^ -IxAc}) 



= i([x,w c ],(-V- 2 )[x,„c] + [x,0c]} 

= \ (|V(50 da | 2 + [x,w e ][x,<?f>e]) 

= ±(\V5^ da \ 2 + &(u)n>(u)[ X ,u] 2 }, 

where V 2 <5^>da := &^da- A sufficient condition for 5 2 H dai [uj c ] to be non-negative is 

<f>'(u)n\u)>0, (6.24) 

that is, the derivatives of <3? and O must have opposite signs. The energy-Casimir 
result is recovered by letting <£(u) = u, for then we have to e = £1(4> C ) and the stability 
condition is the usual Rayleigh criterion, f2'(</> e ) > 0. The stability result (6.24) 
obtained using the dynamical accessibility method is more general. 

6.3.3 Reduced MHD 

The equations of motion and bracket for RMHD are described in Section 2.2.3. The 
dynamical variables are (C ,^ 1 ) = (w,ip). 

Equilibrium Solutions 

We must first determine equilibrium solutions (u> e , ip e ) of the equations of motion (2.14), 
which must satisfy 

[w e ,4] + bPc,Jc] = 0, 
[lp e ,(p e ] = 0. 

To satisfy the second of these conditions we must have <p e = &(u), ip e = ^(u), with u = 
u(x). Using the fact that, for any g{x) and /(u(x)), 

[ 5 (x) , /(«)] = /'(«)[s(x) ,u] = [/'(«) 5 (x) ,u] , (6.25) 
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the first equilibrium condition can be written as 

[$'(«) W e -tt'(u)J ej u] =0. 

This is solved by 

Jc = ^rr~\ ' ( 6 - 26 ) 

W'('U) 

where T(n) is an arbitrary function. Note that this does not necessarily imply that cu e 
or J c are functions of u only 

Formal Stability 

Using the coadjoint bracket for extensions (5.2), the dynamically accessible pertur- 
bations are given by 

<^da= [xo,w] f + [xi^\ ] = ~[Xo,w] - [xi,i>], 

#da = [XoA] ] = -[XO,^]- 

The second-order variation of the Hamiltonian, (6.22), is 

S 2 H da [oJ c ; ^ e ] = \ {Su> da , (-V' 2 ) 5uJ da - [Xo , (f> e ]} 

+ \ (#da , (- V 2 ) <5^ da - [xo,J e ] - [Xl,4>c]) 
= \ <|V^ da | 2 + |V^ da | 2 ) 

- \ ([XO , </>e] Su da + [XO , Je] #da + [Xl , 0e ] ^da) 

=i<iv*,ihiv*^ + |;*,^) 

- \ <[X0 , Je ] #da + $' [Xl , U] <ty>da) , (6-27) 

where we have defined V 2 <50da : = <5wd a - Now we use 

[Xi,«] = ^7 (* / [Xi,w] + [Xo,w e ]) - — [xo,w e ] 
= -^7(^ da + [xo,w e ]), 
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to get 



5 2 H d3 [uj c ; V> c ] = \ ( | V(50da| 2 + | V# da | 2 + 2 — J^da ^d 



+ 



7 [X0,W e ] - [XO, Je]j Stpda) 



$' 



3 (|V^ da | 2 + |V#da| 2 + 2 — (tyda <^da 



*' 



1 



W P 



£1 

Xo,-j"7 



+ 



Xo,-j-y 



Sip, 



da 



where we substituted (6.26) to eliminate J e . To simplify the notation, we define the 
differential operator B by 



»/(») := 



' 4/W 



*'(«) du 



(6.28) 



so that 



<5 2 i^ da [u;e ; V>c] = 2 <|V^ da | 2 + |V^da| 2 + 2 B$ # da V 2 <^ da ) 

-i((w e [xo J B*] + [xo J BT])(yVda)- 



(6.29) 



Note that the first angle bracket in (6.29) is the same as (6.14), with k' replaced 
by B$. Hence, we can use identity (6.14) to obtain 

<5 2 #da = 2 < Wda " V(D$^ da )| 2 + (l " |B$| 2 ) |V^ d a| 2 ) 

+ \ ((B$ V 2 (B$) + uj c B 2 $ + B 2 T) |#da| 2 > • (6.30) 
Sufficient conditions for the perturbation energy (6.30) to be non-negative are [31] 



|B$| < 1, 
B$ V 2 (B<J>) + V 2 $ B 2 $ + B 2 T > 0. 



(6.31) 
(6.32) 



In the second expression we have substituted lu c = V 2< 3?. The first condition says 
that |$'(n)| < ^'(u)!, that is, the gradient of the magnetic flux is greater or equal to 
the gradient of the electric potential. This is a similar condition to (6.16), and says 
that the flow needs to be sub-Alfvenic to be formally stable [45]. This is due to the 
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well-known fact that the magnetic field provides a restoring force for perturbations 
of the flow, so that a large enough magnetic field can potentially stabilize the system 
(but not necessarily so, because the magnetic field can also have a destabilizing 
effect [18]). Indeed, condition (6.31) is actually necessary for positive-definiteness 
of 8 2 H& a . If we choose 5</>da = B<I> c'V'da in (6.30), then the first term vanishes. We can 
then pick a variation of 5ipd& with as steep a gradient as we want, while maintaining 
the value of Stpda bounded [27, p. 103]. This means that the |Vfo/>da| 2 term can 
always be made to dominate, so that we require |B3>| < 1 for positive-definiteness 
of 5 2 H da . 

This places a limitation on the method of dynamical accessibility: if we 
want to satisfy |D$| = <&' /vl/' < 1 everywhere, then on their domain of definition 
the zeros of ^ must also be zeros of <I> with equal or higher multiplicity. (However, 
the function <I> could potentially have more zeros than ^.) 

The simplest case is when ^ has no zeros, but then VP('u) is invertible, and 
we can recover the energy-Casimir result by solving for u = u(^f). In practice, this 
inversion may be difficult, and using the dynamical accessibility method is often 
easier. 

As an example, we will derive equilibria for magnetic islands with flow. Con- 
sider the RMHD equilibrium relation (6.26), multiplied by ^'(u), 

^'(u)J c - $'(«) lo c = T'(u). (6.33) 

where J e = V 2 ^(u). Using the fact that 

co c = V 2 $(u) = $'(«) V 2 u + $"(«) |Vu| 2 , (6.34) 

and the analogous relation for J c , we can rewrite (6.33) as 

((V) 2 - ($') 2 ) V 2 n + (*' tf" - $' $") \Vu\ 2 = T'(n), 

or equivalently 

((tf') 2 - ($') 2 ) V 2 u + i ((^') 2 " (^) 2 )' |Vu| 2 = T'(w). (6.35) 
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We can get rid of the | Vtt| 2 term, and make the equation easier to solve, by choosing 

(Choosing a different sign for the right-hand side would lead to solutions with B<I> > 1.) 
An obvious solution is 

ty'(u) = Kcosh(Vu), (6.36) 

$'(u) = Ksinh(i/«). (6.37) 

These satisfy |D$| = |tanh(z/u)| < 1, condition (6.31). 

Equation (6.35) becomes 

V 2 u = k- 2 T'(u), (6.38) 

to be solved for u(x). This equation has the same form as (6.12), which was an 
equation for Y> e (x), so it has the same Kelvin-Stuart cat's eye solution, 

u(x, y) = ln(a cosh y + \/a 2 — 1 cos x), 

with T'(-u) = k 2 exp(— 2u). The difference is that now the physical variables are 
given in terms of u by (6.36) and (6.37), so that the electric potential (and so the 
flow velocity) does not necessarily vanish, as opposed to the usual magnetic island 
solutions, which are recovered in the limit u = 0. The stability of the islands with 
flow could be very different, since now &' ^ in (6.29). However, as for the usual 
magnetic islands, the sufficient condition (6.32) is not satisfied, so that stability must 
be determined by test perturbations, or by direct numerical simulation [15,22,38,82]. 

6.3.4 Pure Semidirect Sum 

We now treat the general stability of the pure semidirect sum structure, with no 
cocycles (see Section 4.4). This structure is given simply by the n + 1 x n + 1 
matrices W^ ' = I, and W'w = o, // = 1, . . . ,n. We denote the 0th field variable 
by £° = w, and the remaining n variables by £*, . . . , £ n . 
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Equilibrium Solutions 

An equilibrium (zu e , {£,c}) of the equations of motion for a pure semidirect extension 



satisfies 



ii7 e = -[Fo, roc ] t -^[F M ,e] t = o, 

n=i 



i-v- 



(6.39) 
(6.40) 



C--[ff,o,e] t = 0, fi = l,...,n. 

To unclutter the notation, we assume that the first and second derivatives of the 
Hamiltonian H are evaluated at the equilibrium (zu c , {££*}), unless otherwise noted. 

We now specialize the bracket to the 2-D canonical one, Eq. (2.11), so 
that [,]'=—[,]. To satisfy condition (6.40), we require 

H = -*(«), e = S», fi=l,...,n, (6.41) 

for arbitrary functions $, S M , and u = u(x). (The choice of the minus sign for the 
definition of <& is purely a convention to agree with the sign of the streamfunction in 
2-D Euler, for which H q = SH/5co = —(p.) Condition (6.39) is then 



[<&(u),^ e ] + ^[i^,~>)]=0, 



or, using (6.25), 



which has solution 



a 






0, 



(6.42) 



¥(u)vj c + Y J H^'{u) = r{u). 

Equation (6.42) should be compared with (6.26), the equivalent solution for reduced 
MHD, for which n = 1 and H A = 5H/5ip = -J. 

Now that we have the equilibria, using (5.2) we write down the dynamically 
accessible perturbations 

n 

Sw da = [ X0 ,^ + J2[x,,^}\ (6.43) 



v=\ 



K 



da 



[Xo,£ 



Mlt 



n = i, 



(6.44) 
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and from (6.22) we get the second-order dynamically accessible variation of the 
Hamiltonian, 



<5 2 #da = \ ( 5^da , #,00 <^da + E H fifl S^ + [ X0 , # ( 



ra n 

+ EK^a'E F ^^a + ^0^da + [X0,^] + [^,#0]). 

Because the second-order functional derivative is formally a self-adjoint operator, we 
have the identity 

(5zu da , #0^ <5£da) = (<^da ' H ,^ Sw da ) , 

which we use in 5 2 H da to combine two terms and obtain 

n 

S 2 H da _ = \{5w da , , #00 5m da + 2 ^ # 0m <S££ a + [xo, H ,o] 

n n 

+ EK^a'E^^^da + [X0,^] + [^,#0]). (6-45) 

Using the equilibrium solution (6.41), the dynamically accessible variations given 
by (6.44) can be rewritten 

^ = -S^(«)[ X o,u] 

Observe that the perturbations of the £** are not independent: they all depend on a 
single generating function, xo- We choose to write all the variations in terms of 52^. 
We define 

V(x):=ax), ¥(«):= S», 

to explicitly show the special role of £ n . Then we have 

<^da = ^TTT ^da = BS" <tyda, (6-46) 

where we have used the previous definition of the operator ID, 
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Note that DH n = D\& = 1. We could have chosen any field instead of £ n , but in Sec- 
tion 6.3.5 this particular choice will prove advantageous due to the lower-triangular 
structure of our extensions. 

The dynamically accessible variations must obey the constraints of the sys- 
tem, that is they must lie on the coadjoint orbits. We have already discussed briefly 
this property of the semidirect sum in Section 3.4. 

To illustrate the situation we consider the equations of motion for a finite- 
dimensional semidirect sum, specifically a semidirect sum of the rotation group SO(3) 
(associated with our old friend the rigid body) with M 3 (see Section 2.2). We take w 
to be £, the angular momentum vector, with Hamiltonian H given by the usual 
kinetic energy, Eq. (2.10). The variables £^ are three- vectors, and their equations of 
motion are given in terms of the bracket (2.9) by 

(6.47) 
= (rt) x f . 

Note the angular momentum I is analogous to the vorticity to, and I^ 1 t is analogous 
to the streamfunction <f> = V~ 2 u>. Equation (6.47) says that the vector £** is rotating 
with the rigid body, keeping its length constant (the length of £^ is a Casimir). 
Thus, each £ M can be used to describe a point in the rigid body, such as the center of 
gravity. Adding a coupling term to the Hamiltonian can provide us with, for instance, 
a description of the heavy top in a gravitational field, but this would not change 
the form of (6.47). The point is that the £^ are constrained to rotate rigidly, and 
the dynamically accessible perturbations must obey the same constraint — they must 
depend on the perturbation applied to £, but by themselves there are no dynamically 
accessible perturbations that allow the £^ to change length or rotate independently. 
Physically, this makes sense, because we are not allowing the rigid body to have other 
degrees of freedom than the rotational ones. If we did, we would have to rethink our 
description, which would lead to different dynamically accessible perturbations; but 
within the confines of rigidity those perturbations make sense. 
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The situation in infinite dimensions is analogous to the rigid body. Here the 
typical case is an ideal fluid with passive scalars: we take w = w(x), the vorticity, and 
a Hamiltonian of the form H[u>] = —^ ((p > w )- The equations of motion for the £^(x) 
are given by 

t = -{H ,e? = -[<!>, e\- (6-48) 

Thus, the £^(x) are advected along by the fluid. The £^(x) can be used to describe 
passive scalars, since they do not enter the Hamiltonian (they do not affect the flow 
itself). An interaction term in H could describe, for example, the effect of tempera- 
ture on the flow in the Boussinesq approximation, but this would not modify (6.48): 
only the equation for Cj would change. Much like for the rigid body, the quantities ^ 
are constrained to move with the fluid, regardless of the form of the Hamiltonian. 
This is also true for the dynamically accessible perturbations of the £ M , which must 
then be induced by the perturbation on to. 

Formal Stability 

We now try to rewrite the second-order variation of the Hamiltonian (6.45) only in 
terms of dynamically accessible variations. We have, from (6.41), 

[X0,H fl ] =-$'(«) [xo,u] 
= B$(u) ^da- 
Prom the second line of (6.45), we can write 

n n 

n 

(1=1 
>$ (<5u7 da + [ X0 , W e ]) &/>da ' 
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where we have made use of (6.43) and (6.46). Finally, we have 

n n 

fi=i fi=i 

n 

in which we make use of (6.42) to obtain 









da I 



(6.50) 



X0 , W c ] (S^da 

The last term in (6.50) cancels part of (6.49), and we get 

n 

5 2 H d& = \{5w d& #oo Sw da + 2 ^2 <^ ro da H flfi <5^ a + 2B$ 5ro da J^da 

n n 

+ E ^a^^^ a +(D 2 $ OTc + ^^D 2 ^-D 2 T)|# da | 2 ). (6.51) 

Further progress cannot be made without assuming some particular form for the 
second-order functional derivative operator of H. 

Hamiltonian without operators 

The simplest case we can study is when H contains no differential or integral oper- 
ators. Then H^v is J us t a symmetric matrix. Using (6.46), we can simplify (6.51) 
to 



5 2 H da = ±(#oo |fe da | 2 + 2(V H 0lt BZ» + B$W da <^ da 



M=l 



+ ( J^ DH" U,^ BH" + B 2 $ Wc + Y, H^B 2 E" - B 2 TJ |^ da | 
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This can be rewritten as a quadratic form, 

5 2 H da = \ (5w da , 5ip d3i ) Q f s da 

where Q is the 2x2 matrix 

/ #oo tf^BH^ + BS \ 

" y H filt BS^ + B$ DH^ H^ v DH" + B 2 <I> ro c + F M B 2 ~^ - B 2 T J ' 

We assume repeated indices are summed from 1 ton. The matrix Q is non-negative 
if and only if its principal minors are non- negative, i.e., 

#,oo > 0, (6.52) 

det Q > 0. (6.53) 

Hence, to have formal stability it is imperative to have that the energy associ- 
ated with the perturbation of w be non-negative. Also note that the contribu- 
tion of (Hfl^ BH^ + B<&) is always destabilizing. For an equilibrium without flow 
(B<£ = 0) and with H t o u = 0, condition (6.53) reduces to 

BE^ H }/u/ DE" + H^B 2 E^ - B 2 T > 0. 

Advected Scalars 

We now treat the problem of advection of scalars. We shall not restrict ourselves 
to passive advection, and the form we choose for H is general enough to encompass 
systems with generalized vorticities, 2 such as the quasigeostrophic equations [39,94]. 

Let q denote the generalized vorticity, related to the stream function cf> by 

q = V 2 4>-F<P + f, (6.54) 

for some given functions .F(x) and /(x). Taking £° = w = q, we consider a Hamil- 
tonian 

H = (±(\v<p\ 2 +f ( f> 2 )+v( X ,q,e,---,e)) 
= (^-/)(^-v 2 r i (( 7 -/)+v(x, (? ,£ i ,...,n}, 



Also called the potential vorticity. 



Ill 

where V does not contain any operators. We have the first derivatives 

H = -<)> + Vo , Hj, = V,„ , 

and the second derivative operators 

J H",oo = (^-V 2 )- 1 +V 00 , 

Hflu = Vflu ■ 

Using identity (6.14), we can rewrite the first line of the second dynamically accessible 
variation of the energy (6.51) as 



da 



i(&/ da {{F - V 2 )- 1 + Voo) Sc/da + 2 (V 0m BH^ + D$) 5q d& 5^ 

= ^(<¥da {J 7 ~ V 2 ) <^ da + Voo l^daP " 2/C(u) 5Vda (^ " V 2 ^ 
= !(|V<tyda " V(/C # da )| 2 - /C 2 |V^ da | 2 + Voo l^dal 2 

+ T |<5</> da - /C #da| 2 + K (V 2 /C - TK) |# da | 2 ) (6.55) 



where 

£(«) := V fi n BZ»(u) + B$(u). (6.56) 

The term proportional to |V<5?/> da | 2 in (6.55) is negative definite unless we require an 
equilibrium with K.(u) = 0, that is 

V^BE^uJ+BSfa) = 0. 

Using K. = in (6.55) and writing out the rest of (6.51), we obtain 



<* 2 #da = |(|V^ da | 2 + ^|^ d a| 2 + V,00 l^dal 2 

+ (B 2 $ q c + DE" V^ BE U + V M B 2 Z» - B 2 t) |^da| 

For Hamiltonians with V^ = 0, the only equilibria for which we can demonstrate 
formal stability are ones without flow. If we assume this is the case, then from (6.42) 
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equilibria satisfy V iAt BH^(u) =DT(u). Note that / in (6.54) enters the stability 
expression through q c = V 2( 5 — J-$ + /. 

Combining (6.55) with the rest of (6.51) we have the the sufficient conditions 
for stability 

Voo > 0, 
BE^ V^ u OE u + V, l fi 2 s! x - B 2 T > 0, 

where we have assumed B<I> = so that K.{u) = Vo^BH^-u). This is the same 
stability condition as for a Hamiltonian without operators, (6.3.4), because we have 
chose a form of the Hamiltonian which decouples the operator part (kinetic energy) 
and the potential, so we get a Lagrange-theoremdike condition on the potential. 

RMHD-like System 

Another case of interest, a generalization of the RMHD system of Sections 2.2.3 
and 6.3.3, involves a Hamiltonian of the form 

F = i((|V0| 2 + ^0 2 )+2V(x,g,e 1 ,...,e^ 1 ^) + |V^| 2 >. (6.57) 

Here q, J-, and <j> are as i n the previous section in (6.54). As before, we have labeled £ n 
by ip as a reminder of its distinguished role: it enters the Hamiltonian as a gradient. 
(In this section greek indices run from 1 to n — 1.) The first derivatives of H are 
given by 

#o = -0 + V o , H, li = V J1 , H n = -J + V, n , (6.58) 

and the second derivative operators are 



(6.59) 



Hfio 


= (F-v z y 


_1 + Voo 


Hfl n — Vfln 


H,^v 


= v )A «, 




^ ,jxn — ',M n 


Hflv 


= Vflu 




■" ,nn — V r" V nn 
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The quantity J := V 2 ip is analogous to the electric current in RMHD. As before, 
we use H^it) to denote the equilibrium solution of £^ for fj, = l,...,n, and the 
equilibrium solution of £ n is written £™ = ^f(u). Also as done previously, we use the 
relation 

<yeSa = BH"(yVda, (6-46) 

where D is defined by (6.28). Adding the (V^dal 2 contribution to (6.55), we obtain 
|(<J?da ((.F - V 2 )- 1 + Voo) ^da + 2/C <5 9da <fy>da + Wdal 

1/XA. ('T Y72\ £,A i 1) _IX„. |2 or i./, . (T V72\JJ,. i IV7 X„l, . |2 



^<tyda (F " V 2 ) <50 da + Voo |<y?dar " 2 ^ <^da {? " V 2 )^ da + |V# da | 

|(|V^ da - V(/C# da )| 2 + (1 - /C 2 ) |V^ da | 2 + Voo |<^ da | 2 

+ .F |^da - /C# da | 2 + /C (V 2 /C - TK) |# da | 2 ) (6.60) 



where /C is defined by (6.56). The energy provided by the new (V^dal 2 term in 
the Hamiltonian (magnetic line-bending energy in MHD) allows us to have formally 
stable equilibria provided K? < 1. Thus, in contrast to the system in the previous sec- 
tion, there exist formally stable equilibria with flow even for a potential with Vo« = 0. 

6.3.5 Nonsingular g 

Now that we have demonstrated the procedure for obtaining equilibria and determin- 
ing their stability for brackets with no cocycles (Section 6.3.4), we are in a position 
to deal with the more complicated case of an arbitrary semidirect-type extensions 
with a nonsingular Wr n \ = g. We shall make heavy use of the concept of coextension 
introduced in Section 5.4.1. 
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Equilibrium Solutions 

First we must look for equilibria of the equations of motion, which from (2.6) and (5.2) 



arc 



zu c = 



. Hfl , ZU e 



[H0,g]*-[H n ,fa]\ 



A» it 



■[H fi ,& 



\HoMl 



W x 



in/ 



H V ,^Y -g^[H v ,^ 9 ]\ 



(6.61) 
(6.62) 
(6.63) 



Unless otherwise noted, in this section all greek indices take values from 1 to n — 1, 
and repeated indices are summed. The tensors W were defined in Section 5.4: they 
are the subtensors of W with indices restricted from 1 to n — 1. They form a solvable 
extension. We have also made use of the definition g^ v := W^a^ . As in Section 6.3.4, 
we have set the variable £ n apart and labeled it by ip, but now it does actually 
play a distinguished role in the solution of the problem, as it did in Section 5.4. 
Also note that the derivatives of the Hamiltonian are implicitly evaluated at the 
equilibrium (zu c , {£c }, V'e)- 

We now specialize the bracket to the 2-D canonical one, given by (2.12). 
Equation (6.63) is satisfied if 



H t0 = -*(«), ^ c = *(«), 



(6.64) 



for functions <1> and *&, and some u = u(x). Equation (6.62) is quite a bit dicier to 
solve. The trick is to use the lower-triangular form of the W^' to solve for the H v . 
We multiply (6.62) by g := g^ 1 , and use (6.64), to obtain 



Hu),g Tlx ^]+g rix W x ^ 



H,v , £ c 



+ g Tlx g^[H v ,y(u)}=0, 



or, using the definition (5.21) of the coextension, A" x := W T vti g^ 



[ H T tt'(u) + *'(«) g^ g,u]+ A» x H„ , £ 



0. 



(6.65) 
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Since the W"^'s are lower-triangular, the A^'s have the form 





A <r) 







v = 1, . . . ,n — 1, 



where the box represents a square (n — v — 1) -dimensional symmetric matrix of pos- 
sibly nonzero elements. There are never any nonvanishing elements in the first row 
of A^ v \ so setting r = 1 in (6.65) gives 



[H 1 *\u) + &{u)g lll g,u]=Q. 



(6.66) 



We write the solution as 



where k\{u) is an arbitrary function and the operator B is defined by (6.28). Equa- 
tion (6.65) with r = 2 is 



[H^'{u) + ^{u)g 2 ^,u}+A\ x 



H ,l j £e 



0. 



If we then substitute in the solution for Hi, Eq. (6.66), we have 

[H^^^ + ^^g^^^j+Al^h^-O^u)^^,^} =0. (6.67) 

Note that A\ x g ltl = A% x g Ufl = g 2K W X KV g Vil = 92k A \, 1 is symmetric in A and /x. 
Hence, 



A 2\ 



©<&(«) 0i„#>£l =92 K A K Xli \B$(u)^,£ 



Se > Se 



B$( U )g 2K A^ 

\m>'{u)g 2K AiAe c 



+ g 2K Al^e ©*(«), e 



U,Ce 



+ £[«,#] 



= \B$'{u)g 2K A^ 
We can now solve (6.67) for H 2 , resulting in 



«,££ 



A pfJ. 
c se 



H 2 = k 2 (u) - B$(u) g 2fX e + A« x Ok K (u) £ C A - \ B 2 $(u) g 2K A%, £ g, 
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where k-2{u) is another arbitrary function. The procedure carries on in the same 
manner for r > 2, and in general we have 

H T = k T {u) + E ^T ^H...a» & • • • £", (6-68) 






where 



Qg(«) := D (A p rX k p (u) - g rX *(«)) , (6.69) 



and 



qW . (u):=A T \ A T \ ■■■A Tm -\ A Tm -\ 

^>TXl—\m y ' TAX TlA2 T m _3A m _2 T m _2A m _i 



(<.-i;u M«) - g Tm _ lXm *(«)) , (6.70) 



for m > 2. If we define fco(u) := — $(u), we can also write the Q^ m ' in terms of the D 
tensors, defined by (5.28), as 

n— 1 

qM W:=y^.. B m kJu). 

^Al'"A m A m +i V / / / Ai---A m A m +i f\ / 

The sum in m in (6.68) terminates since the At^\ are nilpotent. The Q^ m '{u) are 
symmetric in all their lower indices. 

Note that (6.68) is not a closed-form solution for the equilibria: depending on 
the specific form of the Hamiltonian, the equation may be straightforward or difficult 
to solve, or possibly not have any solutions at all. The situation is the same as for 
Eqs. (6.6) and (6.7) (the energy- Casimir limit for CRMHD), which were solved for p e 
and v e in (6.9). 

The fact that the coextension, which we used to find Casimir invariants in 
Chapter 5, appears in this calculation is not surprising, since the energy-Casimir 
method result is recovered by letting ^f(u) = u, which simply says that ID is replaced 
by d/du. 
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We still have to satisfy (6.61) (zu e = 0) to get an equilibrium. Substituting 
in the results of (6.64) and (6.68), we get the condition 



[ *'(«) if n + $'(«) U7 e ,u] + 



k ^ u ) + E^Qit-xJ u )^---^,e c 



m>\ 



0. 



This can be solved, using the same techniques as for H^, . . . , H, n -\ above, to give 

£ A l . . . £ Am +l 



(m) 



H tn = k n - B<J> w e + Bkf, C^+Yj D( 2aT 



'Am. An 



m>l 



(m + 1)! 



(6.71) 



We now have expressions for the equilibria of arbitrary nonsingular extensions, given 
by (6.64), (6.68), and (6.71). We can proceed to determine their stability. 

Formal Stability 

The dynamically accessible variations are obtained from (5.2), and are just equations 



(6.43) and (6.44) modified appropriately, 



8vj. 



da 



[xvM ] + [x^e\ ] + [xnM\ 



K, = [xo,e\ ] +w x 



/.IAS 



xu,e 



t 



+ «r x*,^ 



it 



Sip, 



da 



'xo,^] 1 - 



(6.72) 

(6.73) 
(6.74) 



Notice that unlike the pure semidirect sum case given by (6.43) and (6.44), the 
dynamically accessible variations for ^ 1 , . . . ,£™ are now potentially independent. 

We can use expression (6.45) for 5 2 H d& of the pure semidirect sum, modified 
to admit a cocycle, 



5 2 H d& = \{5w d& , #oo Szu da + 2i?o M ^da + 2H fln #da + [xo , H i0 ] 
+ 5 (<^da > H ^ <^da + 2H ^ # da + [xo,H ll ] + [Xp,H ]+ W^ T [ Xa , H, 

+ ^(#da,#nn^a+[X0,#n] + [Xn,#0 ]+</"" [x<T,H, T ]). (6.75) 



As we did for the semidirect sum case, we want to express all the brackets in terms of 
dynamically accessible variations. We know we must be able do this by the theorem 
proved at the end of Section 6.3.1. 
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The starting point is the [ \n , Hfi ] term, since it contains \n and thus can 
only be expressed in terms of 5w dai , given by Eq. (6.72). We do not present the 
calculation in detail here because it involves a great deal of algebra, none of which 
is very illuminating. We have to make liberal use of the identity 

easily verified from the definition of Q( m > , Eq. (6.70). 

The final form of the second variation of the Hamiltonian is 



S 2 H dlx = \{5w d& H fi0 5w d& + 25w d& H >0fl 5^ a + 25w d& [H,on + B$j<5Vda 



m>l 



m>l 



da , 

,,,, -., ..„. , 

m>2 

(6.76) 

This very general expression allows us to see exactly where the cocycles modify the 
energy expression. Obtaining a useful result out of it is difficult, so we will do what 
we usually do: we simplify the problem! The case we will treat in more detail is the 
vanishing coextension case. 

6.3.6 Vanishing Coextension 

We consider the case where the coextension ^4 = but g is nonsingular, as is the case 
for CRMHD (see Section 5.5.1). A schematic representation of this type of extension 
is shown in Figure 6.3. Then from (6.69) we have 
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Figure 6.3: Schematic representation of the 3-tensor W for a semidirect extension 
with vanishing coextension (A = 0). The axes are as in Figure 4.2. The red cubes 
represent the n-lxn-1 matrix g^", assumed here nonsingular. Note that com- 
pressible reduced MHD, in Figure 2.1, has this structure. 

and from (6.70) we have Q^ m '{u) = for m>2. We still have tp e = ^(u), and the 
equilibrium relations (6.68) and (6.71) simplify to 



H T = k T -TD>$g rX g, 

H n = A; n - B$ ru c + Bkp e - \ B 2 $ g^x g £, 



(6.77) 
(6.78) 



where as in Section 6.3.5 the greek indices run from 1 to n — 1. The second order 
variation of the Hamiltonian, Eq. (6.76), "reduces" to 



<5 2 -ffda = l(<5wdA ffoo Swda. + 25m da , H t0fJt &^ a + 25m da ,(H fin + B$)#0da 

+ hiKM^ + m) *^)^da) + (<A H ,»n - b^ + B 2 $^ A e e A )#d a ; 

+ U5ip da (H nn - m n + B 2 $ w e - B 2 ^ ££ + | B 3 $^ #£)<tydaV (6.79) 



Again, to make progress we must further specialize the form of the Hamiltonian. 
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RMHD-like System 

Let us take the RMHD-like Hamiltonian (6.57). We first need to find the equilibria, 
which we accomplish by substituting (6.57) into the equilibrium conditions (6.64), (6.77) 
and (6.78), 

-& + V )0 = -$(«), (6.80) 

V, T = k T (u)-B<S>(u)g TX £, (6.81) 



- Jc + V, n = k n {u) - D$(u) g c + Bk^u) e - | © 2 $(u) ff MA e £ , (6-82) 



Since we have not specified the exact dependence of V on the £^, we cannot solve 
these for the £|\ For the pure semidirect sum case, we had £c = S(u), regardless of 
the form of the Hamiltonian. The presence of the nondegenerate cocycle leads to 
potentially much richer equilibria. 

For the perturbation energy, we can use the result (6.60) in (6.79) to obtain 



<* 2 #da = |(|V^0 da - V(/C <fyda)| 2 + (1 " K 2 ) |V# da | 2 + V,00 |<W 

+ T |<50 da - K # da | 2 + 2V, 0li <5g da 6& 

+ (V ^ + B<1> g,J) 5& Ca + 2 (V,„ n - D ^ + B 2 $ g^ &)6& «tyda 

+ ( V nn - Bfc n + D 2 $ g c - B 2 ^ e + iD 3 $ g^ gg + K (V 2 /C - .F/C) ) 

x |^da| 2 ), (6.83) 

where 

JC{u) :=V „ + B$(u). 

Immediately we see that the stability conditions |/C| < 1, F > 0, and V,oo > still 
hold. However, until we have a closed form for the equilibria we cannot make definite 
stability predictions. We now proceed to use a more restricted class of Hamiltonians 
for which the equilibria can be found explicitly. 
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Quadratic Potential 

An important case we can do explicitly is when V is quadratic, 

where 03 is a symmetric matrix, in which case we have 

and Vfl = V n = 0. Inserting this into (6.81), we obtain 

(2J rA +D$3 rA )£ c A = Av-tV. 

Assuming 23 is nondegenerate, the matrix 

W rA := 2J rA + B^ rA (6.84) 

will be invertible except possibly at some points. We denote its inverse by W rA , 
and (6.3.6) has solution 

£ c A (x) = W XT (k T (u) - r (x)). (6.85) 



We emphasize how different this expression is to the pure semidirect sum result, £ A (x) = H(it) 
In (6.85) the equilibrium solution £ A can explicitly depend on x through the Hamil- 
tonian. This can never occur for equilibria of the pure semidirect sum, regardless of 
the form of the Hamiltonian. 

The most interesting feature of the new equilibria (6.85) is the fact that 
there are new resonances in the system — solutions for which W Ar will blow up. 
This is what occurred for CRMHD in Section 6.2, where we had a singularity in the 
solution (6.9) of v e &ndp e , associated with the acoustic resonance. As the equilibrium 
solution approaches this resonance, we can expect the system to become less stable. 

We can use the solution (6.85) in (6.82) to obtain a closed-form result for J c , 

J e = -k n + B$ (V 2 $ - F§ + /) - Bk^ W^ T {k T - o T ) 

+ \ B 2 $ fa - T ) W^ ^ A W Act {k a -X> a ), (6.86) 
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where J c = V 2 ^(u). Using Eq. (6.34) for uj c and the analogous relation for J c , we 
have that (6.86) can be rewritten 



((tf')2 _ ($')2) ($'$"_$'$ 

v y V 2 -u+- — '-\Vu 



2 



-£:„ + B$ (-.F $ + / ) - B£i M W^ T (fc T - D T ) 

+ 1 D 2 $ (fc T - t» T ) W r " g^ x W Xa (k a - tv). (6.87) 

This is a nonlinear PDE to be solved for u(x) with arbitrary functions 3>(u), ^f(u), 
and kfj,(u), and given functions W T/i (x), t) CT (x), .F(x), and /(x). Needless to say, 
solving (6.87) in general is extremely difficult. There are, however, classes of solution 
that can be obtained analytically. We now examine one of these special cases. 

A particularly simple case are the aforementioned Alfven solutions, for which 

qf'(u) = c$'(u), (6.88) 

where c is a constant. We also obtain 

B ®( u ) = tttt ^(«) = ~^r^ $ 'H = c_1 . 

*'(«) du c$'(-u) 

so that B m $ = for m > 1. Thus, assuming that \£' and $' are proportional for 
the dynamical accessibility method is analogous to assuming that ^(ipc) is linear for 
the energy-Casimir method. From (6.88), we might be tempted to simply write <& = 
<I ) (^ / ), and indeed this is true. However, this is not useful because in general we 
still cannot rewrite u in terms of \l/, since \£ = ^>(u) may not be invertible. If VlJ is 
invertible, then we recover the energy-Casimir result completely. 

Using (6.88) in the equilibrium condition (6.87) gives 
(1 - c~ 2 ) (V V 2 u + *" | Vu\ 2 ) = -k n - c~ 2 (Tu - cf) - tip W» T (k T - o r ), (6.89) 

so that the quadratic term (proportional to ID <£>) disappears. 

Several systems have W^ T independent of x. It may then also happen that 
we can choose the k^(u) such that 

k' W^ T t> T = c- 2 (Tu - cf) , (6.90) 
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After this is effected, Eq. (6.89) no longer depends explicitly on x, and has solutions 
such as the Kelvin-Stuart cat's eye. This procedure can be carried out for CRMHD, 
for which T = f = 0. In that case, (6.90) becomes (6.11). 

Stability for Quadratic Potential 

Assuming that we still have the quadratic potential of the previous section, we now 
show that the (acoustic) resonance which occurred for CRMHD is a generic feature 
of Lie-Poisson systems with cocycles. 

We take the energy expression (6.83), use the fact the Vo = V,n = 0, and 
obtain 

5 2 H d& = i(|V<tyda - V(/C <^ da )| 2 + (1 - K 2 ) |V# da | 2 

+ T |<ty da - K # da | 2 + HV 6& 6& + 2(B 2 $ g^ ^ - ®kj6& # da 
+ (D^ge-B^-B^e + ^^^eC + ^CV'/C-^/C)) |^ da | 2 ), 

where we used the definition of W, Eq. (6.84), and we have not made any assumptions 
about the form of $ and ^. The equilibrium solutions £g satisfy (6.85). 

If we assume /C < 1 and J- > 0, then to obtain part of the sufficient conditions 
for stability we require that W be positive-definite. But when W becomes singular 
we cannot guarantee this. This was the case with CRMHD. 

Note that det W = does not imply that the system will be unstable beyond 
the resonance. It is, however, a strong indication that it might be. 



Chapter 7 
Conclusions 



Using the tools of Lie algebra cohomology, we have classified low-order extensions. 
We found that there were only a few normal forms for the extensions, and that 
they involved no free parameters. This is not expected to carry over to higher 
orders (n > 4). The classification includes the Leibniz extension, which we have 
shown is the maximal extension. One of the normal forms is the bracket appropriate 
to compressible reduced MHD. 

We then developed techniques for finding the Casimir invariants of Lie- 
Poisson brackets formed from Lie algebra extensions. We introduced the concept of 
coextension, which allowed us to explicitly write down the solution of the Casimirs. 
The coextension for the Leibniz extension can be found for arbitrary order, so we 
were able obtain the corresponding Casimirs in general. 

By using the method of dynamical accessibility, we derived general condi- 
tions for the formal stability of Lie-Poisson systems. In particular, for compressible 
reduced MHD, we found the presence of a cocycle could only make a certain class of 
solutions more unstable. In general, cocycles were shown to lead to resonances, such 
as the acoustic resonance for CRMHD. 

The dynamical accessibility approach also allowed us to get a clearer picture 
of the role of cocycles: in a pure semidirect extension, the absence of a cocycle means 
that the system necessarily describes an advective system, and the dynamically ac- 
cessible variations are not independent. In contrast, for the nonsingular cocycle case 
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all of the perturbations are independent. The form of the stability condition is thus 
much more complex. 

It would be interesting to generalize the classification scheme presented here 
to a completely general form of extension bracket [72, 77]. Certainly the type of 
coordinate transformations allowed would be more limited, and perhaps one cannot 
go any further than cohomology theory allows. 

Though we have gone a long way in this respect, the interpretation of the 
Casimir invariants has yet to be fully explored, both in a mathematical and a physical 
sense. Mathematically, we could give a precise geometrical relation between cocycles 
and the form of the Casimirs. The cocycle and Casimirs should yield information 
about the holonomy of the system. For this one must study extensions in the frame- 
work of their principal bundle description [21]. Physically, we would like to attach a 
more precise physical meaning to these conserved quantities. The invariants associ- 
ated with simultaneous eigenvectors can be regarded as constraining the associated 
field variable to move with the fluid elements [68]. The field variable can also be 
interpreted as partially labeling a fluid element. Some attempt has been made in 
formulating the Casimir invariants of brackets in such a manner [52,90], but for the 
more complicated invariants a general treatment is still not yet available. 



Appendices 
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Appendix A 



Proof of the Jabobi Identity 



We want to show that the Lie-Poisson bracket 

{F,G} ± (0 = ±U 



6F_ 5G_ 



(2.1) 



where £ € g*, and F : g* 
identity 



and G : g* 



are functionals, satisfies the Jacobi 



{{F,G} ± ,H} ± + {{G,H} ± ,F} ± + {{H,F} ± ,G} ± = 0. 

The inner bracket [ , ] is the bracket of the Lie algebra g, so it satisfies the Jacobi 
identity. The overall sign of the bracket is inconsequential, so we choose the + 
bracket. We first compute the variation of {F , G}, 



6{F,G}^(6Z, 
8H, 



SF_ 5G 
~Sj , ~6j 

6F_ SG 
SF_ SG 



+ e 



S 2 F SG 

SjS£ ^' &f 



+ <*> 



— — Sf 



5^ ' ? 



U 2 F 
'$£<$£ 



<*£} + 



- £ 



U 2 G 



^ 



5 2 F 



SG_ 



£ 



+ 



5 2 G 



5F 



£ 



t* 



where we have used the definition of the coadjoint bracket (2.3) and the self-adjoint 
property of the second derivative operator. Thus, we have 



S{F,G} 



S£ SG_ 



5 2 F 

sjsi 



SG 



t 



+ 



S 2 G 
S£S£ 



- e 



t 
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We can now evaluate the first term of the Jacobi identity, 

6{F,G} 5H 



{{F.G}.n\ = {£, 



6£ ' 6£ 
6£ SG 



5F_ SG 
~5£ '~Sj 



6 2 F 
5_H_ 



SG 



+ 



S 2 G 



sffi 



SF 



■A 



SH_ 
~Sj 



A 



t X2 



S 2 G 
8£6Z 



5_F_ 

1£ 



+ 



,£ 



— i 



f 5 2 F 



5£5i 



— 

~sj^ 



f SH 

'If 



Upon adding permutations of F, G, and H, the second-derivative terms cancel and 
we are left with 



£, 



S_F_ SG_ 
~Sj , ~Sj 



SH_ 
'If 



+ 



SG SH_ 



5£ 



+ 



SH 5F 



SG 



which vanishes by the Jacobi identity in q. 



Appendix B 
Proof of W^ = I 



Out goal is to demonstrate that through a series of lower-triangular coordinate trans- 
formations we can make W^ l > equal to the identity matrix, while preserving the 
lower-triangular nilpotent form of W^ 2 \ . . . , W^ 1 ' . 

We first show that we can always make a series of coordinate transforma- 
tions to make W\ = 5\ . First note that if the coordinate transformation M is 
of the form M = I + L, where / is the identity and L is lower-triangular nilpotent, 
then W^ = M _1 W^ M still has eigenvalue 1, and the matrices 



W&* = M- 1 W M M, n > 1 



are still nilpotent. 

For A > 1 we have 



A-l 



W x u = W X U + W x lu L v l = W X U + Y, W x lv L u l + L x \ (B.l) 



v=2 

1A 



where we used W\ = 1. Owing to the triangular structure of the set of equa- 
tions (B.l) we can always solve for the L\ to make VF^ 11 vanish. This proves the 
first part. 

We now show by induction that if W\ = 5\ , as proved above, then the 
matrix W^ 1 ' is the identity. For A = 1 the result is trivial. Assume that W^ v = 5^, 
for ji < A. Setting two of the free indices to one, Eq. (2.21) can be written 

Wx^ 1 W^ la = W x ^ a W^ 11 

= Wx^ V = W x la . 
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Since W^ l > is lower-triangular the index ju runs from 2 to A (since we are assum- 
ing A > 1): 

A 
/u=2 

and this can be rewritten, for a < A, 

A-l 

^2 W x ^ Wf, la = . 
Finally, we use the inductive hypothesis 

A-l 

J2w^ 1 s^ = w x - 1 = o, 

ii=2 

which is valid for a < X. Hence, W\ a = 5\ a and we have proved the result. (W\ 
must be equal to one since it lies on the diagonal and we have already assumed 
degeneracy of eigenvalues.) 
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